Gratiuale Texts 


inMathematics 


Serge Lang 


Differential 
and Riemannian 
Manifolds 


);) Springer 


Graduate Texts in Mathematics 160 


Editorial Board 
S. Axler F.W. Gehring P.R. Halmos 


Springer 
New York 
Berlin 
Heidelberg 
Barcelona 
Budapest 
Hong Kong 
London 
Milan 

Paris 

Santa Clara 
Singapore 
Tokyo 


BOOKS OF RELATED INTEREST BY SERGE LANG 


Linear Algebra, Third Edition 
1987, ISBN 96412-6 


Undergraduate Algebra, Second Edition 
1990, ISBN 97279-X 


Complex Analysis, Third Edition 
1993, ISBN 97886-0 


Real and Functional Analysis, Third Edition 
1993, ISBN 94001-4 


Algebraic Number Theory, Second Edition 
1994, ISBN 94225-4 


Introduction to Complex Hyperbolic Spaces 
1987, ISBN 96447-9 


OTHER BOOKS BY LANG PUBLISHED BY 
SPRINGER- VERLAG 


Introduction to Arakelov Theory ¢ Riemann-Roch Algebra (with William Fulton) ¢ 
Complex Multiplication © Introduction to Modular Forms ¢ Modular Units (with Daniel 
Kubert) © Fundamentals of Diophantine Geometry ¢ Elliptic Functions «© Number 
Theory III ¢ Cyclotomic Fields I and II ¢ SL,(R) ¢ Abelian Varieties ¢ Introduction to 
Algebraic and Abelian Functions ¢ Undergraduate Analysis ¢ Elliptic Curves: Diophan- 
tine Analysis ¢ Introduction to Linear Algebra ¢ Calculus of Several Variables © First 
Course in Calculus ¢ Basic Mathematics ¢ Geometry: A High School Course (with Gene 
Murrow) © Math! Encounters with High School Students ¢ The Beauty of Doing 


Mathematics © THE FILE 


Serge Lang 


Differential and 
Riemannian Manifolds 


With 20 Illustrations 


)) Springer 


Serge Lang 

Department of Mathematics 
Yale University 

New Haven, CT 06520 


USA 

Editorial Board 

S. Axler F.W. Gehring 

Department of Department of 
Mathematics Mathematics 


Michigan State University 
East Lansing, MI 48824 
USA 


University of Michigan 
Ann Arbor, MI 48109 
USA 


P.R. Halmos 

Department of 
Mathematics 

Santa Clara University 

Santa Clara, CA 95053 

USA 


Mathematics Subject Classifications Code: 58-01 


Library of Congress Cataloging-in-Publication Data 
Lang, Serge, 1927- 
Differential and Riemannian manifolds / Serge Lang. 
cm. — (Graduate texts in mathematics ; 160) 
Includes bibliographical references (p. -  ) and index. 
ISBN-13: 978-1-4612-8688-2 


1. Differentiable manifolds. 2. Riemannian manifolds. I. Title. 
II. Series. 
QA614.3.L34 1995b 
516.3'6—dc20 95-1594 


This is the third edition of Differential Manifolds, originally published by Addison-Wesley 
in 1962. 


Printed on acid-free paper. 


© 1995 Springer-Verlag New York, Inc. 
Softcover reprint of the hardcover 3rd edition 1995 


All rights reserved. This work may not be translated or copied in whole or in part without the 
written permission of the publisher (Springer-Verlag New York, Inc., 175 Fifth Avenue, New 
York, NY 10010, USA) except for brief excerpts in connection with reviews or scholarly 
analysis. Use in connection with any form of information storage and retrieval, electronic 
adaptation, computer software, or by similar or dissimilar methodology now known or 
hereafter developed is forbidden. 

The use of general descriptive names, trade names, trademarks, etc., in this publication, even 
if the former are not especially identified, is not to be taken as a sign that such names, as 
understood by the Trade Marks and Merchandise Marks Act, may accordingly be used freely 
by anyone. 


Production coordinated by Brian Howe and managed by Terry Kornak; manufacturing super- 


vised by Jeffrey Taub. 
Typeset by Asco Trade Typesetting Ltd., Hong Kong. 


9 8 7 6 5 4 3 2 (Second corrected printing, 1996) 


ISBN-13: 978-1-4612-8688-2 e-ISBN-13: 978-1-4612-4182-9 
DOI: 10.1007/978-1-4612-4182-9 


Preface 


This is the third version of a book on differential manifolds. The first 
version appeared in 1962, and was written at the very beginning of 
a period of great expansion of the subject. At the time, I found no 
satisfactory book for the foundations of the subject, for multiple reasons. 
I expanded the book in 1971, and I expand it still further today. 
Specifically, I have added three chapters on Riemannian and pseudo 
Riemannian geometry, that is, covariant derivatives, curvature, and some 
applications up to the Hopf—-Rinow and Hadamard—Cartan theorems, as 
well as some calculus of variations and applications to volume forms. I 
have rewritten the sections on sprays, and I have given more examples of 
the use of Stokes’ theorem. I have also given many more references to 
the literature, all of this to broaden the perspective of the book, which I 
hope can be used among things for a general course leading into many 
directions. The present book still meets the old needs, but fulfills new 
ones. 

At the most basic level, the book gives an introduction to the basic 
concepts which are used in differential topology, differential geometry, 
and differential equations. In differential topology, one studies for instance 
homotopy classes of maps and the possibility of finding suitable 
differentiable maps in them (immersions, embeddings, isomorphisms, etc.). 
One may also use differentiable structures on topological manifolds to 
determine the topological structure of the manifold (for example, a la 
Smale [Sm 67]). In differential geometry, one puts an additional struc- 
ture on the differentiable manifold (a vector field, a spray, a 2-form, a 
Riemannian metric, ad lib.) and studies properties connected especially 
with these objects. Formally, one may say that one studies properties 
invariant under the group of differentiable automorphisms which preserve 
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the additional structure. In differential equations, one studies vector fields 
and their integral curves, singular points, stable and unstable manifolds, 
etc. A certain number of concepts are essential for all three, and are so 
basic and elementary that it is worthwhile to collect them together so 
that more advanced expositions can be given without having to start 
from the very beginnings. 

It is possible to lay down at no extra cost the foundations (and much 
more beyond) for manifolds modeled on Banach or Hilbert spaces rather 
than finite dimensional spaces. In fact, it turns out that the exposition 
gains considerably from the systematic elimination of the indiscriminate 
use of local coordinates x,, ...,x, and dx,, ..., dx,. These are replaced 
by what they stand for, namely isomorphisms of open subsets of the 
manifold on open subsets of Banach spaces (local charts), and a local 
analysis of the situation which is more powerful and equally easy to use 
formally. In most cases, the finite dimensional proof extends at once 
to an invariant infinite dimensional proof. Furthermore, in studying 
differential forms, one needs to know only the definition of multilinear 
continuous maps. An abuse of multilinear algebra in standard treatises 
arises from an unnecessary double dualization and an abusive use of the 
tensor product. 

I don’t propose, of course, to do away with local coordinates. They 
are useful for computations, and are also especially useful when inte- 
grating differential forms, because the dx, A -:: A dx, corresponds to the 
dx,+::dx, of Lebesgue measure, in oriented charts. Thus we often give 
the local coordinate formulation for such applications. Much of the 
literature is still covered by local coordinates, and I therefore hope that 
the neophyte will thus be helped in getting acquainted with the literature. 
I also hope to convince the expert that nothing is lost, and much is 
gained, by expressing one’s geometric thoughts without hiding them 
under an irrelevant formalism. 

It is profitable to deal with infinite dimensional manifolds, modeled on 
a Banach space in general, a self-dual Banach space for pseudo Rieman- 
nian geometry, and a Hilbert space for Riemannian geometry. In the 
standard pseudo Riemannian and Riemannian theory, readers will note 
that the differential theory works in these infinite dimensional cases, with 
the Hopf—Rinow theorem as the single exception, but not the Cartan— 
Hadamard theorem and its corollaries. Only when one comes to dealing 
with volumes and integration does finite dimensionality play a major 
role. Even if via the physicists with their Feynman integration one even- 
tually develops a coherent analogous theory in the infinite dimensional 
case, there will still be something special about the finite dimensional 
case. 

One major function of finding proofs valid in the infinite dimensional 
case is to provide proofs which are especially natural and simple in the 
finite dimensional case. Even for those who want to deal only with finite 
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dimensional manifolds, I urge them to consider the proofs given in this 
book. In many cases, proofs based on coordinate free local representa- 
tions in charts are clearer than proofs which are replete with the claws of 
a rather unpleasant prying insect such as Tj,,. Indeed, the bilinear map 
associated with a spray (which is the quadratic map corresponding to a 
symmetric connection) satisfies quite a nice local formalism in charts. I 
think the local representation of the curvature tensor as in Proposition 
1.2 of Chapter IX shows the efficiency of this formalism and its superior- 
ity over local coordinates. Readers may also find it instructive to com- 
pare the proof of Proposition 2.6 of Chapter IX concerning the rate of 
growth of Jacobi fields with more classical ones involving coordinates as 
in [He 78], pp. 71-73. 

Of course, there are also direct applications of the infinite dimensional 
case. Some of them are to the calculus of variations and to physics, for 
instance as in Abraham—Marsden [AbM 78]. It may also happen that 
one does not need formally the infinite dimensional setting, but that it is 
useful to keep in mind to motivate the methods and approach taken in 
various directions. For instance, by the device of using curves, one can 
reduce what is a priori an infinite dimensional question to ordinary 
calculus in finite dimensional space, as in the standard variation formulas 
given in Chapter IX, §4. 

Similarly, the proper domain for the geodesic part of Morse theory is 
the loop space (or the space of certain paths), viewed as an infinite 
dimensional manifold, but a substantial part of the theory can be de- 
veloped without formally introducing this manifold. The reduction to the 
finite dimensional case is of course a very interesting aspect of the situa- 
tion, from which one can deduce deep results concerning the finite di- 
mensional manifold itself, but it stops short of a complete analysis of the 
loop space. (Cf. Boot [Bo 60], Milnor [Mi 63].) This was already 
mentioned in the first version of the book, and since then, the papers of 
Palais [Pa 63] and Smale [Sm 64] appeared, carrying out the program. 
They determined the appropriate condition in the infinite dimensional 
case under which this theory works. 

In addition, given two finite dimensional manifolds X, Y it is fruitful 
to give the set of differentiable maps from X to Y an infinite dimensional 
manifold structure, as was started by Eells [Ee 58], [Ee 59], [Ee 61], and 
[Ee 66]. By so doing, one transcends the purely formal translation of 
finite dimensional results getting essentially new ones, which would in 
turn affect the finite dimensional case. 

Foundations for the geometry of manifolds of mappings are given in 
Abraham’s notes of Smale’s lectures [Ab 60] and Palais’s monograph 
[Pa 68]. 

For more recent applications to critical point theory and submanifold 
geometry, see [PaT 88]. 

One especially interesting case of Banach manifolds occurs in the 
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theory of Teichmuller spaces, which, as shown by Bers, can be embedded 
as submanifolds of a complex Banach space. Cf. [Ga 87], [Vi 73]. 

In the direction of differential equations, the extension of the stable 
and unstable manifold theorem to the Banach case, already mentioned as 
a possibility in the earlier version of this book, was proved quite ele- 
gantly by Irwin [Ir 70], following the idea of Pugh and Robbin for 
dealing with local flows using the implicit mapping theorem in Banach 
spaces. I have included the Pugh—Robbin proof, but refer to Irwin’s 
paper for the stable manifold theorem which belongs at the very begin- 
ning of the theory of ordinary differential equations. The Pugh—Robbin 
proof can also be adjusted to hold for vector fields of class H? (Sobolev 
spaces), of importance in partial differential equations, as shown by Ebin 
and Marsden [EbM 70]. 

It is a standard remark that the C%-functions on an open subset of a 
euclidean space do not form a Banach space. They form a Fréchet space 
(denumerably many norms instead of one). On the other hand, the im- 
plicit function theorem and the local existence theorem for differential 
equations are not true in the more general case. In order to recover 
similar results, a much more sophisticated theory is needed, which is only 
beginning to be developed. (Cf. Nash’s paper on Riemannian metrics 
[Na 56], and subsequent contributions of Schwartz [Sc 60] and Moser 
[Mo 61].) In particular, some additional structure must be added 
(smoothing operators). Cf. also my Bourbaki seminar talk on the sub- 
ject [La 61]. This goes beyond the scope of this book, and presents an 
active topic for research. 

I have emphasized differential aspects of differential manifolds rather 
than topological ones. I am especially interested in laying down basic 
material which may lead to various types of applications which have 
arisen since the sixties, vastly expanding the perspective on differential geom- 
etry and analysis. For instance, I expect the marvelous book [BGV 92] 
to be only the first of many to present the accumulated vision from 
the seventies and eighties, after the work of Atiyah, Bismut, Bott, Gilkey, 
McKean, Patodi, Singer, and many others. 


New Haven, 1994 SERGE LANG 


Added Comments, 1995. Immediately after the present book appeared 
in 1995, two other books also appeared which I wish to recommend very 
highly. One of them is the second edition of Gilkey’s book Invariance 
Theory, the Heat Equation, and the Atiyah-Singer Index Theorem (CRC 
Press, 1995). The other is the second edition of Klingenberg’s Riemannian 
Geometry (Walter de Gruyter, 1995), which includes a nice chapter on the 
infinite dimensional Hilbert manifold of H'-mappings, and several sub- 
stantial applications to topology and closed geodesics on various com- 
pact manifolds. 
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CHAPTER | 


Differential Calculus 


We shall recall briefly the notion of derivative and some of its useful 
properties. As mentioned in the foreword, Chapter VIII of Dieudonné’s 
book or my book on real analysis [La 93] give a self-contained and 
complete treatment for Banach spaces. We summarize certain facts con- 
cerning their properties as topological vector spaces, and then we sum- 
marize differential calculus. The reader can actually skip this chapter and 
start immediately with Chapter II if the reader is accustomed to thinking 
about the derivative of a map as a linear transformation. (In the finite 
dimensional case, when bases have been selected, the entries in the matrix 
of this transformation are the partial derivatives of the map.) We have 
repeated the proofs for the more important theorems, for the ease of the 
reader. 

It is convenient to use throughout the language of categories. The 
notion of category and morphism (whose definitions we recall in §1) is 
designed to abstract what is common to certain collections of objects and 
maps between them. For instance, topological vector spaces and continu- 
ous linear maps, open subsets of Banach spaces and differentiable maps, 
differentiable manifolds and differentiable maps, vector bundles and vec- 
tor bundle maps, topological spaces and continuous maps, sets and just 
plain maps. In an arbitrary category, maps are called morphisms, and in 
fact the category of differentiable manifolds is of such importance in this 
book that from Chapter II on, we use the word morphism synonymously 
with differentiable map (or p-times differentiable map, to be precise). All 
other morphisms in other categories will be qualified by a prefix to 
indicate the category to which they belong. 
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I, §1. CATEGORIES 


A category is a collection of objects {X, Y,...} such that for two objects 
X, Y we have a set Mor(X, Y) and for three objects X, Y, Z a mapping 
(composition law) 


Mor(X, Y) x Mor(Y, Z) > Mor(X, Z) 
satisfying the following axioms: 


CAT 1. Two sets Mor(X, Y) and Mor(X’, Y’) are disjoint unless X = 
X’ and Y = Y’, in which case they are equal. 


CAT 2. Each Mor(X, X) has an element idy which acts as a left and 
right identity under the composition law. 


CAT 3. The composition law is associative. 


The elements of Mor(X, Y) are called morphisms, and we write 
frequently f{: XY for such a morphism. The composition of two 
morphisms f, g is written fg or fog. 

A functor 4:2 — W’ from a category WW into a category YW’ is a map 
which associates with each object X in M an object A(X) in W’, and with 
each morphism f: X > Y a morphism /(f): A(X) > A(Y) in W’ such that, 
whenever f and g are morphisms in 2 which can be composed, then 
A( fg) = A(f)A(g) and A(idy) = id,y) for all X. This is in fact a covariant 
functor, and a contravariant functor is defined by reversing the arrows 
(so that we have A(f): A(Y) > A(X) and A(fg) = A(g)ACf)). 

In a similar way, one defines functors of many variables, which may 
be covariant in some variables and contravariant in others. We shall meet 
such functors when we discuss multilinear maps, differential forms, etc. 

The functors of the same variance from one category 2 to another W’ 
form themselves the objects of a category Fun(Q, 2’). Its morphisms 
will sometimes be called natural transformations instead of functor mor- 
phisms. They are defined as follows. If 4, » are two functors from W to 
QW’ (say covariant), then a natural transformation t:A4— py consists of a 
collection of morphisms 

ty: A(X) > w(X) 


as X ranges over &, which makes the following diagram commutative for 
any morphism f: X > Y in QW: 


Ax) —%> U(X) 


Mf ] [ur ) 
A(Y) caress HY) 
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In any category %, we say that a morphism f: X > Y is an isomor- 
phism if there exists a morphism g: YX such that fg and gf are the 
identities. For instance, an isomorphism in the category of topological 
spaces is called a topological isomorphism, or a homeomorphism. In 
general, we describe the category to which an isomorphism belongs by 
means of a suitable prefix. In the category of sets, a set-isomorphism is 
also called a bijection. 

If f:X + Y is a morphism, then a section of f is defined to be a 
morphism g: Y > X such that fog = idy. 


1, §2. TOPOLOGICAL VECTOR SPACES 


The proofs of all statements in this section, including the Hahn—Banach 
theorem and the closed graph theorem, can be found in [La 93]. 

A topological vector space E (over the reals R) is a vector space with a 
topology such that the operations of addition and scalar multiplication 
are continuous. It will be convenient to assume also, as part of the 
definition, that the space is Hausdorff, and locally convex. By this we 
mean that every neighborhood of 0 contains an open neighborhood U of 
0 such that, if x, y are in U and 0 <t <1, then tx + (1 — t)y also lies in 
U. 

The topological vector spaces form a category, denoted by TVS, if we 
let the morphisms be the continuous linear maps (by linear we mean 
throughout R-linear). The set of continuous linear maps of one topologi- 
cal vector space E into F is denoted by L(E,F). The continuous r- 
multilinear maps 
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of E into F will be denoted by L'(E, F). Those which are symmetric 
(resp. alternating) will be denoted by L{(E,F) or Lj,,,(E, F) (resp. 
L‘(E, F)). The isomorphisms in the category TVS are called toplinear 
isomorphisms, and we write Lis(E, F) and Laut(E) for the toplinear iso- 
morphisms of E onto F and the toplinear automorphisms of E. 

We find it convenient to denote by L(E), L’(E), Li(E), and L{(E) the 
continuous linear maps of E into R (resp. the continuous, r-multilinear, 
symmetric, alternating maps of E into R). Following classical terminol- 
ogy, it is also convenient to call such maps into R forms (of the corre- 
sponding type). If E,, ...,E, and F are topological vector spaces, then 
we denote by L(E,,...,E,;F) the continuous multilinear maps of the 
product E, x-:- x E, into F. We let: 


End(E) = L(—, E), 
Laut(E) = elements of End(E) which are invertible in End(E). 
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The most important type of topological vector space for us is the 
Banachable space (a TVS which is complete, and whose topology can be 
defined by a norm). We should say Banach space when we want to put 
the norm into the structure. There are of course many norms which can 
be used to make a Banachable space into a Banach space, but in prac- 
tice, one allows the abuse of language which consists in saying Banach 
space for Banachable space (unless it is absolutely necessary to keep the 
distinction). ; 

For this book, we assume from now on that all our topological vector 
spaces are Banach spaces. We shall occasionally make some comments to 
indicate where it might be possible to generalize certain results to more 
general spaces. We denote our Banach spaces by E, F, .... 

The next two propositions give two aspects of what is known as the 
closed graph theorem.. 


Proposition 2.1. Every continuous bijective linear map of E onto F is a 
toplinear isomorphism. 


Proposition 2.2. If E is a Banach space, and F,, F, are two closed 
subspaces which are complementary (i.e. E = F, + F, and F, 0 F, = 0), 
then the map of F, x F, onto E given by the sum is a toplinear 
isomorphism. 


We shall frequently encounter a situation as in Proposition 2.2, and if 
F is a closed subspace of E such that there exists a closed complement 
F, such that E is toplinearly isomorphic to the product of F and F, 
under the natural mapping, then we shall say that F splits in E. 

Next, we state a weak form of the Hahn—Banach theorem. 


Proposition 2.3. Let E be a Banach space and x #0 an element of E. 
Then there exists a continuous linear map A of E into R such that 
Ax) # 0. 


One constructs 4 by Zorn’s lemma, supposing that 4 is defined on 
some subspace, and having a bounded norm. One then extends 4 to the 
subspace generated by one additional element, without increasing the 
norm. 

In particular, every finite dimensional subspace of E splits if E is 
complete. More trivially, we observe that a finite codimensional closed 
subspace also splits. 

We now come to the problem of putting a topology on L(E, F). Let 
E, F be Banach spaces, and let 


A:E-F 
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be a continuous linear map (also called a bounded linear map). We can 
then define the norm of A to be the greatest lower bound of all numbers 
K such that 

|Ax| S K|x| 


for all x € E. This norm makes L(E, F) into a Banach space. 


In a similar way, we define the topology of L(E,,...,E,; F), which is a 
Banach space if we define the norm of a multilinear continuous map 


A: E, x-++x E,>F 
by the greatest lower bound of all numbers K such that 


|A(x1, aX) < K|x,| S81 X ale 
We have: 


Proposition 2.4. If E,, ...,E,, F are Banach spaces, then the canonical 
map 
L(E,, L(Ey, ...,L(E,, F), ...)) > L'(E,, ...,E,; F) 


from the repeated continuous linear maps to the continuous multilinear 
maps is a toplinear isomorphism, which is norm-preserving, i.e. a Banach- 
isomorphism. 


The preceding propositions could be generalized to a wider class of 
topological vector spaces. The following one exhibits a property peculiar 
to Banach spaces. 


Proposition 2.5. Let E, F be two Banach spaces. Then the set of 
toplinear isomorphisms Lis(E, F) is open in L(E, F). 


The proof is in fact quite simple. If Lis(E, F) is not empty, one is 
immediately reduced to proving that Laut(E) is open in L(E,E). We 
then remark that if ue L(E, E), and |u| < 1, then the series 


lt+utut+-:: 


converges. Given any toplinear automorphism w of E, we can find an 
open neighborhood by translating the open unit ball multiplicatively 
from 1 to w. 

Again in Banach spaces, we have: 


Proposition 2.6. If E, F, G are Banach spaces, then the bilinear maps 


L(E, F) x L(F, G) > L(E, G), 
L(E, F) x E>F, 
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obtained by composition of mappings are continuous, and similarly for 
multilinear maps. 


Remark. The preceding proposition is false for more general spaces 
than Banach spaces, say Fréchet spaces. In that case, one might hope 
that the following may be true. Let U be open in a Fréchet space and 
let 

f: U- LE€, F), 


g: U > Lif, G), 


be continuous. Let y be the composition of maps. Then y(f, g) is contin- 
uous. The same type of question arises later, with differentiable maps 
instead, and it is of course essential to know the answer to deal with the 
composition of differentiable maps. 


Il, §3. DERIVATIVES AND COMPOSITION OF MAPS 


A real valued function of a real variable, defined on some neighborhood 
of 0 is said to be o(t) if 


lim o(t)/t = 0. 
170 


Let E, F be two topological vector spaces, and @ a mapping of a 
neighborhood of 0 in E into F. We say that @ is tangent to 0 if, given a 
neighborhood W of 0 in F, there exists a neighborhood V of 0 in E such 
that 

o(tV) < o(t)W 


for some function o(t). If both E, F are normed, then this amounts to 
the usual condition 


lp(x)] S |x W(x) 


with lim w(x) = 0 as |x| 0. 

Let E, F be two topological vector spaces and U open in E. Let 
f: U—F be a continuous map. We shall say that f is differentiable at a 
point x, éU if there exists a continuous linear map 4 of E into F such 
that, if we let 


f (Xo + y) = f(Xo) + Ay + Oy) 


for small y, then @ is tangent to 0. It then follows trivially that A is 
uniquely determined, and we say that it is the derivative of f at xo. We 
denote the derivative by Df(x,) or f’(xo). It is an element of L(E, F). If 
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f is differentiable at every point of U, then f’ is a map 
f: U-LE€, F). 
It is easy to verify the chain rule. 


Proposition 3.1. If f:U-—-V is differentiable at xo, if g: V7>W is 
differentiable at f(xo), then go f is differentiable at Xo, and 


(9 °F) (Xo) = 9'(F(%0)) © f'(Xo)- 
Proof. We leave it as a simple (and classical) exercise. 


The rest of this section is devoted to the statements of the differential 
calculus. All topological vector spaces are assumed to be Banach spaces 
(i.e. Banachable). Then L(E, F) is also a Banach space, if E and F are 
Banach spaces. 

Let U be open in E and let f: UF be differentiable at each point of 
U. If f’ is continuous, then we say that f is of class C’. We define 
maps of class C? (p = 1) inductively. The p-th derivative D’f is defined as 
D(D?'f) and is itself a map of U into 


L(E, L(®, ...,L(E, F)---)) 


which can be identified with L?(E, F) by Proposition 2.4. A map f is 
said to be of class C? if its kth derivative D*f exists for 1 <k < p, and is 
continuous. 


Remark. Let f be of class C”, on an open set U containing the origin. 
Suppose that f is locally homogeneous of degree p near 0, that is 


S(tx) = t?f (x) 


for all t and x sufficiently small. Then for all sufficiently small x we 
have 


1 
f(x) = pee (0)x™, 
where x?) = (x, x, ...,x), p times. 
This is easily seen by differentiating p times the two expressions for 


f(tx), and then setting t=0. The differentiation is a trivial application of 
the chain rule. 
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Proposition 3.2. Let U, V be open in Banach spaces. If f: U->V and 
g: VF are of class C”, then so is go f. 


From Proposition 3.2, we can view open subsets of Banach spaces as 
the objects of a category, whose morphisms are the continuous maps of 
class C?. These will be called C?-morphisms. We say that f is of class C® 
if it is of class C” for all integers p = 1. From now on, p is an integer 
> 0 or o (C° maps being the continuous maps). In practice, we omit 
the prefix C’ if the p remains fixed. Thus by morphism, throughout the 
rest of this book, we mean C’-morphism with p < oo. We shall use the 
word morphism also for C?-morphisms of manifolds (to be defined in the 
next chapter), but morphisms in any other category will always be prefixed 
so as to indicate the category to which they belong (for instance bundle 
morphism, continuous linear morphism, etc.). 


Proposition 3.3. Let U be open in the Banach space EK, and let 
f:U—-F be a C?-morphism. Then D?f (viewed as an element of 
L?(E, F)) is symmetric. 


Proposition 3.4. Let U be open in E, and let f;; U > F, (i= 1,...,n) be 
continuous maps into spaces F,. Let f =(f,,...,f,) be the map of U 
into the product of the F;. Then f is of class C” if and only if each f, 
is of class C”, and in that case 


Df =(D?f,,...,D°f,). 
Let U, V be open in spaces E,, E, and let 
f:UxV—-F 


be a continuous map into a Banach space. We can introduce the notion 
of partial derivative in the usual manner. If (x, y) is in U x V and we 
keep y fixed, then as a function of the first variable, we have the deriva- 
tive as defined previously. This derivative will be denoted by D, f(x, y). 
Thus 

D,f:U x V>L(E,, F) 


is a map of U x V into L(E,,F). We call it the partial deriative with 
respect to the first variable. Similarly, we have D, f, and we could take n 
factors instead of 2. The total derivative and the partials are then related 
as follows. 


Proposition 3.5. Let U,, ...,U, be open in the spaces E,, ...,E, and let 
f: U, x +++ x U,—-F be a continuous map. Then f is of class C? if and 
only if each partial derivative D,f:U, x -::U,,— L(E;, F) exists and is 
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of class C?~1. If that is the case, then for x =(x,,...,X,) and 


v=(v,,...,0,)€ E, x: x E,, 
we have 


Df(x)-(1, ---5%n) = Y) Dif): 0%. 


The next four propositions are concerned with continuous linear and 
multilinear maps. 


Proposition 3.6. Let E, F be Banach spaces and f:E—F a continuous 
linear map. Then for each x € E we have 


S'(x) =f. 


Proposition 3.7. Let E, F, G be Banach spaces, and U open in E. Let 
f:U—-F be of class C? and g:F >G continuous and linear. Then 
gof is of class C? and 


D'(go f)=go DPf. 
Proposition 3.8. If E,, ...,E, and F are Banach spaces and 
f:E, x: x E,-F 


a continuous multilinear map, then f is ofclass C”, and its (r + 1)-st 
derivative is 0. If r = 2, then Df is computed according to the usual rule 
for derivative of a product (first times the derivative of the second plus 
derivative of the first times the second). 


Proposition 3.9. Let E, F be Banach spaces which are toplinearly iso- 
morphic. If u:E—F is a toplinear isomorphism, we denote its inverse 
by u-!. Then the map 

urul 
from Lis(E, F) to Lis(F, E) is a C®-isomorphism. Its derivative at a 
point ug is the linear map of L(E, F) into L(F, E) given by the formula 


ve uo vuG'. 


Finally, we come to some statements which are of use in the theory of 
vector bundles. 


Proposition 3.10. Let U be open in the Banach space E and let F, G be 
Banach spaces. 


10 DIFFERENTIAL CALCULUS LI, §4] 


(i) If f: U > L(E, F) is a C?-morphism, then the map of U x E into F 
given by 
(x, v) H+ f(x)v 
is a morphism. 


(ii) If f: U + L(E,F) and g: U-L(F, G) are morphisms, then so is 
y(f, g) (y being the composition). 

(iii) If f: U-->R and g: U > L(E, F) are morphisms, so is fg (the value 
of fg at x is f(x)g(x), ordinary multiplication by scalars). 

(iv) If f,; g: U > L(&, F) are morphisms, so is f + g. 


This proposition concludes our summary of results assumed without 
proof. 


|, §4. INTEGRATION AND TAYLOR’S FORMULA 


Let E be a Banach space. Let I denote a real, closed interval, say 
ast<b. A step mapping 
f:I-E 


is a mapping such that there exists a finite number of disjoint sub- 
intervals I,, ...,J, covering I such that on each interval J;, the mapping 
has constant value, say v; We do not require the intervals I; to be 
closed. They may be open, closed, or half-closed. 

Given a sequence of mappings f, from J into E, we say that it 
converges uniformly if, given a neighborhood W of 0 into E, there exists 
an integer nm, such that, for all n, m> np, and all teJ, the difference 
S(t) — f(t) lies in W. The sequence f, then converges to a mapping f of 
T into E. 

A ruled mapping is a uniform limit of step mappings. We leave to the 
reader the proof that every continuous mapping is ruled. 

If f is a step mapping as above, we define its integral 


b b 
| fet f(t) dt = ¥ w(I)v;, 


where p(J;) is the length of the interval J; (its measure in the standard 
Lebesgue measure). This integral is independent of the choice of intervals 
I; on which f is constant. 

If f is ruled and f =lim f, (lim being the uniform limit), then the 


sequence 
b 
Sn 
a 


converges in E to an element of E independent of the particular sequence 
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f, used to approach f uniformly. We denote this limit by 


b b 
| f= | F(t) dt 


and call it the integral of f. The integral is linear in f, and satisfies the 
b 


usual rules concerning changes of intervals. (If b <a then we define 


to be minus the integral from b to a.) 
As an immediate consequence of the definition, we get: 


Proposition 4.1. Let 4:E—R be a continuous linear map and let 
f:I-E be ruled. Then Af = 10 f is ruled, and 


b b 
a{ fiat = | Af (t) dt. 


Proof. If f, is a sequence of step functions converging uniformly to f, 
then Af, is ruled and converges uniformly to 4f. Our formula follows at 
once. 


Taylor’s Formula. Let E, F be Banach spaces. Let U be open in E. 
Let x, y be two points of U such that the segment x + ty lies in U for 
O<t<l. Let 

f:U-F 


be a C?-morphism, and denote by y” the “vector” (y,...,y) p times. 


Then the function D?f(x + ty): y™ is continuous in t, and we have 


— = 4 DP Fey? ™ 


fle + y= fo) + te t$ Ee 


p-1 
; i =F De p(x + ty)y dt. 
o (p—1)! 

Proof. By the Hahn-—Banach theorem, it suffices to show that both 
sides give the same thing when we apply a functional A (continuous 
linear map into R). This follows at once from Proposition 3.7 and 4.1, 
together with the known result when F = R. In this case, the proof pro- 
ceeds by induction on p, and integration by parts, starting from 


f(x +y)-f%) = I, Df(x + ty)y dt. 


The next two corollaries are known as the mean value theorem. 
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Corollary 4.2. Let E, F be two Banach spaces, U open in E, and x, z 
two distinct points of U such that the segment x + t(z — x) (0<t <1) 
lies in U. Let f: U >F be continuous and of class C!. Then 


|f(z) — f(%)| S |z — x| sup |f'()I, 
the sup being taken over & in the segment. 


Proof. This comes from the usual estimations of the integral. Indeed, 
for any continuous map g: I > F we have the estimate 


b 
| g(t) dt 


if K is a bound for g on I, and a<b. This estimate is obvious for step 
functions, and therefore follows at once for continuous functions. 


<= K(b —a) 


Another version of the mean value theorem is frequently used. 


Corollary 4.3. Let the hypotheses be as in Corollary 4.2. Let x9 be a 
point on the segment between x and z. Then 


f(z) — f(x) — f’%o)(z — x)| S$ lz — xl sup |f'(S) — fo), 
the sup taken over all & on the segment. 
Proof. We apply Corollary 4.2 to the map 
g(x) = f(x) — f'(Xo)x. 
Finally, let us make some comments on the estimate of the remainder 


term in Taylor’s formula. We have assumed that D?f is continuous. 
Therefore, D?f(x + ty) can be written 


D?f(x + ty) = D?f(x) + W(y, 0d, 
where w depends on y, t (and x of course), and for fixed x, we have 
lim |/(y, t)| = 0 
as |y| 0. Thus we obtain: 


Corollary 4.4. Let E, F be two Banach spaces, U open in E, and x a 
point of U. Let f: UF be of class C’, p= 1. Then for all y such 
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that the segment x + ty lies in U (0St S 1), we have 


P (p) 
foe + 9) = flo + POPP 4 4 PLEO +) 


with an error term @(y) satisfying 
lim 6(y)/|y|? = 0. 
y>0 


|, §5. THE INVERSE MAPPING THEOREM 


The inverse function theorem and the existence theorem for differential 
equations (of Chapter IV) are based on the next result. 


Lemma 5.1 (Contraction Lemma or Shrinking Lemma). Let M be a 
complete metric space, with distance function d, and let f: M—M bea 
mapping of M into itself. Assume that there is a constant K, 0 < K <1, 
such that, for any two points x, y in M, we have 


d(f(x), f(y) S K d(x, y). 


Then f has a unique fixed point (a point such that f(x) = x). Given any 
point Xo in M, the fixed point is equal to the limit of f"(xo) (iteration of 
f repeated n times) as n tends to infinity. 


Proof. This is a trivial exercise in the convergence of the geometric 
series, which we leave to the reader. 


Theorem 5.2. Let E, F be Banach spaces, U an open subset of E, and 
let f: U>F a C?-morphism with p21. Assume that for some point 
Xo € U, the derivative f'(x9):E—F is a toplinear isomorphism. Then f 
is a local C?-isomorphism at Xo. 


(By a local C?-isomorphism at x), we mean that there exists an open 
neighborhood V of x9 such that the restriction of f to V establishes a 
C?-isomorphism between V and an open subset of E.) 


Proof. Since a toplinear isomorphism is a C®%-isomorphism, we may 
assume without loss of generality that E=F and f’(x,) is the identity 
(simply by considering f’(x.) +o f instead of f). After translations, we 
may also assume that x) = 0 and f(x) = 0. 

We let g(x) = x — f(x). Then g'(x9) = 0 and by continuity there exists 
r > 0 such that, if |x| < 2r, we have 


Ig’) < 2- 
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From the mean value theorem, we see that |g(x)|<4|x| and hence g 
maps the closed ball of radius r, B,(0) into B,2(0). 

We contend: Given y € B,.(0), there exists a unique element x € B.(0) 
such that f(x) = y. We prove this by considering the map 


g(x) =y + x — f(x). 


If |y| Sr/2 and |x| <r, then |g,(x)| Sr and hence g, may be viewed as a 
mapping of the complete metric space B,(0) into itself. The bound of 4 
on the derivative together with the mean value theorem shows that g, is 
a contracting map, i.e. that 


9y(%1) — Gy(X2)1 = 191) — g0%2)1 S 214 — x2 


for x,;, X, €B,(0). By the contraction lemma, it follows that gy has a 
unique fixed point. But the fixed point of g, is precisely the solution of 
the equation f(x) = y. This proves our contention. 

We obtain a local inverse g = f~'. This inverse is continuous, be- 
cause 


1X1 — X2| S| f(%1) — f(%2)| + lg1) — 9&2)! 


Ix; — X2| S2|f(x1) — f(%2)). 


and hence 


Furthermore @ is differentiable in B,,(0). Indeed, let y, = f(x,) and 
Y2 = f(x2) with y,, yz € B,2(0) and x,, x, € B,(0). Then 


1o(y1) — P(¥2) — f'(%2) (41 — Y2)| = |x, — x2 — f' (x2) *(F(x1) — f(x2))I. 


We operate on the expression inside the norm sign with the identity 
id = f'(x2) "f'(%2). 


Estimating and using the continuity of f’, we see that for some constant 
A, the preceding expression is bounded by 


A\f'(x2)(%1 — X2) — f(%1) + f(%2)I. 


From the differentiability of f, we conclude that this expression is 
o(x, — X2) which is also o(y, — yz) in view of the continuity of @ proved 
above. This proves that @ is differentiable and also that its derivative is 
what it should be, namely 


oy) = f'(e)), 


r 66s 


for y€B,(0). Since the mappings @, f’, “inverse” are continuous, it 
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follows that ¢’ is continuous and thus that ¢ is of class C!. Since taking 
inverses is C® and f’ is C?"', it follows inductively that @ is C’, as was 
to be shown. 


Note that this last argument also proves: 


Proposition 5.3. If f: U—V is a homeomorphism and is of class C? 
with p = 1, and if f is a C}-isomorphism, then f is a C?-isomorphism. 


In some applications it is necessary to know that if the derivative of a 
map is close to the identity, then the image of a ball contains a ball of 
only slightly smaller radius. The precise statement follows. In this book, 
it will be used only in the proof of the change of variables formula, and 
therefore may be omitted until the reader needs it. 


Lemma 5.4. Let U be open in E, and let f: UE be of class ci. 
Assume that f(0)=0, f’(0)=I. Let r>0 and assume that B,(0) c U. 
Let 0 < s < 1, and assume that 


If@—-f' Ml ss 


for all x, ze BO). If yeE and |y|<(1—s)r, then there exists a 
unique x € B,(0) such that f(x) = y. 


Proof. The map g, given by g,(x) = x — f(x) + y is defined for |x| Sr 
and |y| < (1 — s)r, and maps B,(0) into itself because, from the estimate 


f(x) — xl = |f) — FO — f’Ox| S |x| sup |f'@) — f'O) S sr, 


we obtain 
\g,(x)| S$ sr +(1 —s)r =r. 


Furthermore, g, is a shrinking map because, from the mean value theo- 
rem, we get 


|gy(X1) — Gy(%2)I = [x1 — X2 — (S01) — F0%2))I 


= |x, — X2 — f’(O)(%1 — X2) + 6(%1, X2)| 


= |6(%1, x2), 
where 


|5(x1, X2)| S |x. — x2 sup |f"(z) — f’O)| S s|x1 — x2). 


Hence g, has a unique fixed point xe B(0) which is such that f(x) = y. 
This proves the lemma. 
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We shall now prove some useful corollaries, which will be used in 
dealing with immersions and submersions later. We assume that mor- 
phism means C?-morphism with p = 1. 


Corollary 5.5. Let U be an open subset of EK, and f: U>F, x F, a 
morphism of U into a product of Banach spaces. Let Xo € U, suppose 
that f(xo) = (0,0) and that f'(xo9) induces a toplinear isomorphism of E 
and F, =F, x 0. Then there exists a local isomorphism g of F, x F, 
at (0, 0) such that 

gof:U>F, x EK, 


maps an open subset U, of U into F, x 0 and induces a local isomor- 
phism of U, at Xo on an open neighborhood of 0 in F,. 


Proof. We may assume without loss of generality that F, = E (iden- 
tify by means of f’(x )) and x) = 0. We define 


g:UxF,->F, x F, 
by the formula 
P(x, Ya) = f(x) + 0, y2) 


for xe U and y,e€F,. Then ¢(x, 0) = f(x), and 
g'(0, 0) = f’(0) + ©, id,). 


Since f’(0) is assumed to be a toplinear isomorphism onto F, x 0, it 
follows that @’(0, 0) is also a toplinear isomorphism. Hence by the theo- 
rem, it has a local inverse, say g, which obviously satisfies our require- 
ments. 


Corollary 5.6. Let E, F be Banach spaces, U open in EK, and f: U>F 
a C?-morphism with p=1. Let xy9¢U. Suppose that f(x9)=0 and 
f'(X%o) gives a toplinear isomorphism of E on a closed subspace of F 
which splits. Then there exists a local isomorphism g: F > F, x F, at 0 
and an open subset U, of U containing x9 such that the composite map 
g° f induces an isomorphism of U, onto an open subset of F,. 


Considering the splitting assumption, this is a reformulation of Corol- 
lary 5.5. 


It is convenient to define the notion of splitting for injections. If E, F 
are topological vector spaces, and 14: E-—F is a continuous linear map, 
which is injective, then we shall say that / splits if there exists a toplinear 
isomorphism «: F > F, x F, such that «od induces a toplinear isomor- 
phism of E onto F, = F, x 0. In our corollary, we could have rephrased 
our assumption by saying that f’(x9) is a splitting injection. 
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For the next corollary, dual to the preceding one, we introduce the 
notion of a local projection. Given a product of two open sets of Banach 
spaces V, x V, and a morphism f: V, x V, >F, we say that f is a pro- 
jection (on the first factor) if f can be factored 


Ye xe 


into an ordinary projection and an isomorphism of V, onto an open 
subset of F. We say that f is a local projection at (a,, a.) if there exists 
an open neighborhood U, x U, of (a,,a,) such that the restriction of f 
to this neighborhood is a projection. 


Corollary 5.7. Let U be an open subset of a product of Banach spaces 
E, x E, and (a,,a,) a point of U. Let f: U +F be a morphism into a 
Banach space, say f(a,,@,) = 0, and assume that the partial derivative 


D, f(a,, a2): E, > F 


is a toplinear isomorphism. Then there exists a local isomorphism h of a 
product V, x V, onto an open neighborhood of (a,,a,) contained in U 
such that the composite map 


f 


h 
V, x , —> U— F 
is a projection (on the second factor). 
Proof. We may assume (a,, a.) = (0, 0) and E, = F. We define 


go: E, x E,->E, x E, 
by 
(X14, X2) = (x, f, x2) 


locally at (a,,a,). Then g’ is represented by the matrix 


O ) 

Dif Daf 

and is therefore a toplinear isomorphism at (a,,a,). By the theorem, it 
has a local inverse h which clearly satisfies our requirements. 


Corollary 5.8. Let U be an open subset of a Banach space E and 
f: UF a morphism into a Banach space F. Let x)5¢U and assume 
that f'(xo) is surjective, and that its kernel splits. Then there exists an 
open subset U’ of U containing Xo and an isomorphism 


hiV, x VWy>U' 
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such that the composite map f oh is a projection 
Vx hoKy-F. 


Proof. Again this is essentially a reformulation of the corollary, taking 
into account the splitting assumption. 


Theorem 5.9 (The Implicit Mapping Theorem). Let U, V be open sets 
in Banach spaces E, F respectively, and let 


f:UxV3G 
be a C” mapping. Let (a, b)e U x V, and assume that 
D, f(a, b): F>G 
is a toplinear isomorphism. Let f(a, b) = 0. Then there exists a continu- 


ous map g: Uy > V defined on an open neighborhood Uy of a such that 
g(a) = b and such that 


f(x, g(x) = 0 


for all x € Uy. If Uy is taken to be a sufficiently small ball, then g is 
uniquely determined, and is also of class C?. 


Proof. Let A4=D,f(a,b). Replacing f by 24-'o f we may assume 
without loss of generality that D, f(a, b) is the identity. Consider the map 


@:UxV7>ExF 
given by 
o(x, y) = (x, f(x, y)). 


Then the derivative of @ at (a, b) is immediately computed to be repre- 
sented by the matrix 


Peel Ge b) Dafa ») ~ ee b) «:) 


whence ¢ is locally invertible at (a, b) since the inverse of De(a, b) exists 


and is the matrix 
idy O 
—D, f(a,b) idp) 
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We denote the local inverse of g by yw. We can write 


W(x, 2) = (x, h(x, z)) 


where h is some mapping of class C?. We define 
g(x) = h(x, 0). 


Then certainly g is of class C? and 


(x, f(x, g(x))) = (x, g(x)) = (x, h(x, 0)) = plW(x, 0)) = (x, 0). 


This proves the existence of a C” map g satisfying our requirements. 

Now for the uniqueness, suppose that gy is a continuous map defined 
near a such that go(a)=b and f(x, go(x))=c for all x near a. Then 
Jo(x) is near b for such x, and hence 


(x, Jo(x)) = (x, 0). 


Since @ is invertible near (a, b) it follows that there is a unique point 
(x, y) near (a,b) such that (x, y) =(x,0). Let Uy be a small ball on 
which g is defined. If go is also defined on Up, then the above argument 
shows that g and gy coincide on some smaller neighborhood of a. Let 
xeéU, and let v=x-—a. Consider the set of those numbers ¢ with 
0<t<1 such that g(a + tv) = go(a + tv). This set is not empty. Let s 
be its least upper bound. By continuity, we have g(a + sv) = go(a + sv). 
If s < 1, we can apply the existence and that part of the uniqueness just 
proved to show that g and go are in fact equal in a neighborhood of 
a+ sv. Hence s=1, and our uniqueness statement is proved, as well as 
the theorem. 


Note. The particular value f(a,b)=0 in the preceding theorem is 
irrelevant. If f(a,b)=c for some c #0, then the above proof goes 
through replacing 0 by c everywhere. 


CHAPTER Il 


Manifolds 


Starting with open subsets of Banach spaces, one can glue them together 
with C?-isomorphisms. The result is called a manifold. We begin by 
giving the formal definition. We then make manifolds into a category, 
and discuss special types of morphisms. We define the tangent space at 
each point, and apply the criteria following the inverse function theorem 
to get a local splitting of a manifold when the tangent space splits at a 
point. 

We shall wait until the next chapter to give a manifold structure to 
the union of all the tangent spaces. 
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Let X be a set. An atlas of class C? (p=>0) on X is a collection of 
pairs (U,,9,) (i ranging in some indexing set), satisfying the following 
conditions: 

AT 1. Each U;, is a subset of X and the U, cover X. 


AT 2. Each 9, is a bijection of U; onto an open subset ,U, of some 
Banach space E; and for any i, j, @(U;- U,) is open in E,. 


AT 3. The map 
9; *: @(U; 0 U,) > @(U;, 0 U)) 


is a C?-isomorphism for each pair of indices i, j. 


It is a trivial exercise in point set topology to prove that one can give 
X a topology in a unique way such that each U, is open, and the g, 
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are topological isomorphisms. We see no reason to assume that X is 
Hausdorff. If we wanted X to be Hausdorff, we would have to place a 
separation condition on the covering. This plays no role in the formal 
development in Chapters II and III. It is to be understood, however, 
that any construction which we perform (like products, tangent bundles, 
etc.) would yield Hausdorff spaces if we start with Hausdorff spaces. 

Each pair (U;, g;) will be called a chart of the atlas. If a point x of X 
lies in U,, then we say that (U,, @;) is a chart at x. 

In condition AT 2, we did not require that the vector spaces be the 
same for all indices i, or even that they be toplinearly isomorphic. If 
they are all equal to the same space E, then we say that the atlas is an 
E-atlas. If two charts (U;, g;) and (U;, g;) are such that U; and U; have a 
non-empty intersection, and if p = 1, then taking the derivative of 99; * 
we see that E; and E; are toplinearly isomorphic. Furthermore, the set of 
points x ¢ X for which there exists a chart (U;,@,) at x such that E, is 
toplinearly isomorphic to a given space E is both open and closed. 
Consequently, on each connected component of X, we could assume that 
we have an E-atlas for some fixed E. 

Suppose that we are given an open subset U of X and a topological 
isomorphism g~: UU’ onto an open subset of some Banach space E. 
We shall say that (U,@) is compatible with the atlas {(U,,@,)} if each 
map 9,9 ' (defined on a suitable intersection as in AT 3) is a C?- 
isomorphism. Two atlases are said to be compatible if each chart of one 
is compatible with the other atlas. One verifies immediately that the 
relation of compatibility between atlases is an equivalence relation. An 
equivalence class of atlases of class C? on X is said to define a structure 
of C?-manifold on X. If all the vector spaces E; in some atlas are 
toplinearly isomorphic, then we can always find an equivalent atlas for 
which they are all equal, say to the vector space E. We then say that X 
is an E-manifold or that X is modeled on E. 

If E=R" for some fixed n, then we say that the manifold is n- 
dimensional. In this case, a chart 


og: UR" 


is given by n coordinate functions 9, ...,9,. If P denotes a point of U, 
these functions are often written 


X,(P), ...,X,(P), 


or simply x,,...,x,. They are called local coordinates on the manifold. 

If the integer p (which may also be oo) is fixed throughout a discus- 
sion, we also say that X is a manifold. 

The collection of C?’-manifolds will be denoted by Man’. If we 
look only at those modeled on spaces in a category Y% then we write 
Man?(2). Those modeled on a fixed E will be denoted by Man?(E). We 
shall make these into categories by defining morphisms below. 
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Let X be a manifold, and U an open subset of X. Then it is possible, 
in the obvious way, to induce a manifold structure on U, by taking as 
charts the intersections 


(U0 U, g\(U;O U)). 


If X is a topological space, covered by open subsets V;, and if we are 
given on each V, a manifold structure such that for each pair j, j’ the 
induced structure on V,- V, coincides, then it is clear that we can give to 
X a unique manifold structure inducing the given ones on each J,. 


Example. Let X be the real line, and for each open interval U,, let 9; 
be the function g(t)= 1°. Then the 9,g;' are all equal to the identity, 
and thus we have defined a C”-manifold structure on R! 


If X, Y are two manifolds, then one can give the product X x Y a 
manifold structure in the obvious way. If {(U;,@,)} and {(V, y,)} are 
atlases for X, Y respectively, then 


{(U; x Vj, Pi X W)} 


is an atlas for the product, and the product of compatible atlases gives 
rise to compatible atlases, so that we do get a well-defined product 
structure. 

Let X, Y be two manifolds. Let f: X +> Y be a map. We shall say 
that f is a C?-morphism if, given x € X, there exists a chart (U, ) at x 
and a chart (V, w) at f(x) such that f(U) < V, and the map 


wofogt: QU >W 


is a C’-morphism in the sense of Chapter I, §3. One sees then immedi- 
ately that this same condition holds for any choice of charts (U, @) at x 
and (V, W) at f(x) such that f(U) c V. 

It is clear that the composite of two C?-morphisms is itself a C?- 
morphism (because it is true for open subsets of vector spaces). The 
C?-manifolds and C?’-morphisms form a category. The notion of iso- 
morphism is therefore defined, and we observe that in our example of 
the real line, the map t++?t? gives an isomorphism between the funny 
differentiable structure and the usual one. 

If f: X > Y is a morphism, and (U, @) is a chart at a point xe X, 
while (V, ) is a chart at f(x), then we shall also denote by 


fy,ui' QU > WV 


the map wfg"!. 
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It is also convenient to have a local terminology. Let U be an open 
set (of a manifold or a Banach space) containing a point x). By a local 
isomorphism at x) we mean an isomorphism 


f:U,;7V 


from some open set U, containing x (and contained in U) to an open 
set V (in some manifold or some Banach space). Thus a local isomor- 
phism is essentially a change of chart, locally near a given point. 


Manifolds of maps. Even starting with a finite dimensional manifold, 
the set of maps satisfying various smoothness conditions forms an infinite 
dimensional manifold. This story started with Eells [Ee 58], [Ee 59], 
[Ee 61]. Palais and Smale used such manifolds of maps in their Morse 
theory [Pa 63], [Ab 62], [Sm 64]. For a brief discussion of subsequent 
developments, see [Mar 74], p. 67, referring to [Eb 70], [Ee 66], 
[El 67], [Kr 72], [Le 67], [Om 70], and [Pa 68]. Two kinds of maps 
have played a role: the C” maps of course, with various values of p, but 
also maps satisfying Sobolev conditions, and usually denoted by H*. The 
latter form Hilbert manifolds (definition to be given later). 
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Let X be a topological space, and Y a subset of X. We say that Y is 
locally closed in X if every point ye Y has an open neighborhood U in 
X such that YOU is closed in U. One verifies easily that a locally 
closed subset is the intersection of an open set and a closed set. For 
instance, any open subset of X is locally closed, and any open interval is 
locally closed in the plane. 

Let X be a manifold (of class C? with p = 0). Let Y be a subset of X 
and assume that for each point ye Y there exists a chart (V, W) at y such 
that w gives an isomorphism of V with a product V, x V, where V, is 
open in some space E, and V, is open in some space E,, and such that 


WY AV) =\ x a 


for some point a, eV, (which we could take to be 0). Then it is clear 
that Y is locally closed in X. Furthermore, the map wW induces a 
bijection 

Wi YaVoy,. 


The collection of pairs (Y AV, W,) obtained in the above manner constitutes 
an atlas for Y, of class C?. The verification of this assertion, whose 
formal details we leave to the reader, depends on the following obvious 
fact. 
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Lemma 2.1. Let U,, U,, Vi, Vz be open subsets of Banach spaces, and 
g: U, x U, > V, x V, a C?-morphism. Let a, U, and b,€ V, and as- 
sume that g maps U, x a, into V, x b,. Then the induced map 


g::U,>V, 
is also a morphism. 


Indeed, it is obtained as a composite map 
U, + U, x U, > VY x V,>%,, 


the first map being an inclusion and the third a projection. 

We have therefore defined a C?-structure on Y which will be called a 
submanifold of X. This structure satisfies a universal mapping property, 
which characterizes it, namely: 


Given any map f:Z-— xX from a manifold Z into X such that f(Z) is 
contained in Y. Let fy:Z—Y be the induced map. Then f is a mor- 
phism if and only if fy is a morphism. 


The proof of this assertion depends on Lemma 2.1, and is trivial. 


Finally, we note that the inclusion of Y into X is a morphism. 
If Y is also a closed subspace of X, then we say that it is a closed 
submanifold. 


Suppose that X is finite dimensional of dimension n, and that Y is a 
submanifold of dimension r. Then from the definition we see that the 
local product structure in a neighborhood of a point of Y can be ex- 
pressed in terms of local coordinates as follows. Each point P of Y has 
an open neighborhood U in X with local coordinates (x,,...,x,) such 
that the points of Y in U are precisely those whose last n — r coordinates 
are 0, that is, those points having coordinates of type 


(x4, ...5%,, 0, ...,0). 


Let f: Z— X be a morphism, and let ze Z. We shall say that f is an 
immersion at z if there exists an open neighborhood Z, of z in Z such 
that the restriction of f to Z, induces an isomorphism of Z, onto a 
submanifold of X. We say that f is an immersion if it is an immersion at 
every point. 


Note that there exist injective immersions which are not isomorphisms 
onto submanifolds, as given by the following example: 
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(The arrow means that the line approaches itself without touching.) An 
immersion which does give an isomorphism onto a submanifold is called 
an embedding, and it is called a closed embedding if this submanifold is 
closed. 

A morphism f: X > Y will be called a submersion at a point xe X if 
there exists a chart (U, m) at x and a chart (V,W) at f(x) such that 
gives an isomorphism of U on a products U, x U, (U, and U, open in 
some Banach spaces), and such that the map 


vfo™ = fy,y: U, x U,>V 


is a projection. One sees then that the image of a submersion is an open 
subset (a submersion is in fact an open mapping). We say that f is a 
submersion if it is a submersion at every point. 

For manifolds modelled on Banach spaces, we have the usual criterion 
for immersions and submersions in terms of the derivative. 


Proposition 2.2. Let X, Y be manifolds of class C? (p = 1) modeled on 
Banach spaces. Let f: X > Y be a C?-morphism. Let x € X. Then: 


(i) f is an immersion at x if and only if there exists a chart (U, @) at 
x and (V, W) at f(x) such that fy y(@x) is injective and splits. 

(ii) f is a submersion at x if and only if there exists a chart (U, @) at 
x and (V,w) at f(x) such that fy, y(px) is surjective and its kernel 
splits. 


Proof. This is an immediate consequence of Corollaries 5.4 and 5.6 of 
the inverse mapping theorem. 


The conditions expressed in (i) and (ii) depend only on the derivative, 
and if they hold for one choice of charts (U, ~) and (V, w) respectively, 
then they hold for every choice of such charts. It is therefore convenient 
to introduce a terminology in order to deal with such properties. 

Let X be a manifold of class C? (p= 1). Let x be a point of X. We 
consider triples (U, ~, v) where (U, @) is a chart at x and v is an element 
of the vector space in which @U lies. We say that two such triples 
(U, 9, v) and (V, w, w) are equivalent if the derivative of yo! at @x maps 
v on w. The formula reads: 


(Woe) (px)v = w 
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(obviously an equivalence relation by the chain rule). An equivalence 
class of such triples is called a tangent vector of X at x. The set of such 
tangent vectors is called the tangent space of X at x and is denoted by 
T,(X). Each chart (U, @) determines a bijection of T,(X) on a Banach 
space, namely the equivalence class of (U, ~, v) corresponds to the vector 
v. By means of such a bijection it is possible to transport to T,(X) the 
structure of topological vector space given by the chart, and it is immedi- 
ate that this structure is independent of the chart selected. 

If U, V are open in Banach spaces, then to every morphism of class 
C? (p= 1) we can associate its derivative Df(x). If now f:X > Y is a 
morphism of one manifold into another, and x a point of X, then by 
means of charts we can interpret the derivative of f on each chart at x 
as a mapping 


df(x) = T.f: TAX) > Tr). 


Indeed, this map T,f is the unique linear map having the following 
property. If (U, @) is a chart at x and (V,W) is a chart at f(x) such that 
f(U) < V and 0 is a tangent vector at x represented by v in the chart 
(U, @), then 

Tf (6) 


is the tangent vector at f(x) represented by Df, ,(x)v. The representation 
of T, f on the spaces of charts can be given in the form of a diagram 


T(x) —— E 
Tf [ft 
Tray(Y) —— F 
The map T,f is obviously continuous and linear for the structure of 
topological vector space which we have placed on T,(X) and Tj,,,(Y). 
As a matter of notation, we shall sometimes write f, , instead of Tf. 


The operation T satisfies an obvious functorial property, namely, if 
f: X > Y and g: Y > Z are morphisms, then 


Tg ° f) = Trg) ° TS), 
T,(id) = id. 


We may reformulate Proposition 2.2: 


Proposition 2.3. Let X, Y be manifolds of class C” (p 2 1) modelled on 
Banach spaces. Let f: X + Y be a C?-morphism. Let x € X. Then: 


(i) f is an immersion at x if and only if the map Tf is injective and 
splits. 

(ii) f is a submersion at x if and only if the map T,f is surjective and 
its kernel splits. 
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Note. If X, Y are finite dimensional, then the condition that T,f splits 
is superfluous. Every subspace of a finite dimensional vector space splits. 


Example. Let E be a (real) Hilbert space, and let <x, y> eR be its 
inner product. Then the square of the norm f(x) = <x, x> is obviously 
of class C®. The derivative f’(x) is given by the formula 


f'(x)y = 20% y> 


and for any given x #0, it follows that the derivative f’(x) is surjective. 
Furthermore, its kernel is the orthogonal complement of the subspace 
generated by x, and hence splits. Consequently the unit sphere in Hilbert 
space is a submanifold. 


If W is a submanifold of a manifold Y of class C? (p= 1), then the 
inclusion 
i: WY 
induces a map 
Ti: T,(W) > T,(Y) 


which is in fact an injection. From the definition of a submanifold, one 
sees immediately that the image of T,i splits. It will be convenient to 
identify T,(W) in T,,(Y) if no confusion can result. 

A morphism f: X — Y will be said to be transversal over the submani- 
fold W of Y if the following condition is satisfied. 

Let x eX be such that f(x)eW. Let (V,W) be a chart at f(x) such 
that W: VV, x V, is an isomorphism on a product, with 


W(f(X))=(0,0) and W(WoV)=V, x0. 


Then there exists an open neighborhood U of x such that the composite 


map 
ee ue ey eG 


is a submersion. 
In particular, if f is transversal over W, then f~'(W) is a submanifold 
of X, because the inverse image of 0 by our local composite map 


propof 


is equal to the inverse image of WO V by yw. 
As with immersions and submersions, we have a characterization of 
transversal maps in terms of tangent spaces. 


Proposition 2.4. Let X, Y be manifolds of class C? (p = 1) modeled on 
Banach spaces. Let f: X + Y be a C?-morphism, and W a submanifold 
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of Y. The map f is transversal over W if and only if for each xe X 
such that f(x) lies in W, the composite map 


T, 
T(X) = Tyl¥) > Ty(Y)/TolW) 
with w = f(x) is surjective and its kernel splits. 


Proof. If f is transversal over W, then for each point x € X such that 
f(x) lies in W, we choose charts as in the definition, and reduce the 
question to one of maps of open subsets of Banach spaces. In that case, 
the conclusion concerning the tangent spaces follows at once from the 
assumed direct product decompositions. Conversely, assume our condi- 
tion on the tangent map. The question being local, we can assume that 
Y = V, x V, is a product of open sets in Banach spaces such that W = 
V, x 0, and we can also assume that X =U is open in some Banach 
space, x =0. Then we let g: UV, be the map zof where z is the 
projection, and note that our assumption means that g’(0) is surjective 
and its kernel splits. Furthermore, g~'(0) = f~'(W). We can then use 
Corollary 5.7 of the inverse mapping theorem to conclude the proof. 


Remark. In the statement of our proposition, we observe that the 
surjectivity of the composite map is equivalent to the fact that T,(Y) is 
equal to the sum of the image of T, f and T,(W), that is 


T,(Y) = Im(T,f) + Im(T,i), 


where i: W— Y is the inclusion. In the finite dimensional case, the other 
condition is therefore redundant. 


If E is a Banach space, then the diagonal A in E x E is a closed 
subspace and splits: Either factor E x 0 or 0 x E is a closed complement. 
Consequently, the diagonal is a closed submanifold of E x E. If X is 
any manifold of class C?, p= 1, then the diagonal is therefore also a 
submanifold. (It is closed of course if and only if X is Hausdorff.) 

Let f: X > Z and g: YZ be two C?-morphisms, p 2 1. We say that 
they are transversal if the morphism 


f[xg:xX x YoZxZ 


is transversal over the diagonal. We remark right away that the sur- 
jectivity of the map in Proposition 2.4 can be expressed in two ways. 
Given two points xe X and yeY such that f(x)=g(y) =z, the 
condition 


Im(7.f) + Im(T,g) = F(Z) 
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is equivalent to the condition 


Im(T,.,)(f x g)) + Tz, 2(A) = Tz,2(Z x Z). 


Thus in the finite dimensional case, we could take it as definition of 
transversality. 

We use transversality as a sufficient condition under which the fiber 
product of two morphisms exists. We recall that in any category, the 
fiber product of two morphisms f: X > Z and g: YZ over Z consists 
of an object P and two morphisms 


g;: P73 X and g2:P > Y 


such that fog, =gogp,, and satisfying the universal mapping property: 
Given an object S and two morphisms u,:S > X and u,:S— Y such that 
fu, = gu, there exists a unique morphism u: S > P making the following 


diagram commutative: 


».¢ Y 


% 


The triple (P, g;, 92) is uniquely determined, up to a unique isomorphism 
(in the obvious sense), and P is also denoted by X xz, Y. 

One can view the fiber product unsymmetrically. Given two mor- 
phisms f, g as in the following diagram: 


Y 
| 
Xx —Z 
f 
assume that their fiber product exists, so that we can fill in the diagram: 
X xz,Y —— Y 


| | 


x ——Z 
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We say that g, is the pull back of g by f, and also write it as f*(g). 
Similarly, we write X xz Y as f*(Y). 

In our category of manifolds, we shall deal only with cases when the 
fiber product can be taken to be the set-theoretic fiber product on which 
a manifold structure has been defined. (The set-theoretic fiber product is 
the set of pairs of points projecting on the same point.) This determines 
the fiber product uniquely, and not only up to a unique isomorphism. 


Proposition 2.5. Let f: X > Z and g: YZ be two C?-morphisms with 
p= 1. If they are transversal, then 


(f x g)*(Az), 


together with the natural morphisms into X and Y (obtained from the 
projections), is a fiber product of f and g over Z. 


Proof. Obvious. 


To construct a fiber product, it suffices to do it locally. Indeed, let 
f: X >Z and g: YZ be two morphisms. Let {V,} be an open covering 
of Z, and let 


Lif Wow and go: g Wy 


be the restrictions of f and g to the respective inverse images of V;. Let 
P=(f x g)\(Az). Then P consists of the points (x, y) with xe X and 
yeY such that f(x) = g(y). We view P as a subspace of X x Y (ie. with 
the topology induced by that of X x Y). Similarly, we construct P, with 
jf, and g;. Then P is open in P. The projections on the first and second 
factors give natural maps of P, into f-'(V) and g™+(V,), and of P into X 
and Y. 


Proposition 2.6. Assume that each P, admits a manifold structure (com- 
patible with its topology) such that these maps are morphisms, making P, 
into a fiber product of f, and g;. Then P, with its natural projections, is 
a fiber product of f and g. 


To prove the above assertion, we observe that the P, form a covering 
of P. Furthermore, the manifold structure on P.O P; induced by that of 
P, or P; must be the same, because it is the unique fiber product struc- 
ture over YO V;, for the maps f, and g,, (defined on f\(Vin Y) and 
g ‘(Vic V) respectively). Thus we can give P a manifold structure, in 
such a way that the two projections into X and Y are morphisms, and 
make P into a fiber product of f and g. 
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We shall apply the preceding discussion to vector bundles in the 
next chapter, and the following local criterion will be useful. 


Proposition 2.7. Let f: X > Z be a morphism, and g: Z x WZ be the 
projection on the first factor. Then f, g have a fiber product, namely 
the product X x W together with the morphisms of the following 
diagram: 


id 
yuw 225 cew 


Pry; | [x 


xX —— Z 
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Let X be a manifold of class C?. A function on X will be a morphism of 
X into R, of class C’, unless otherwise specified. The C? functions form 
a ring %(X). The support of a function f is the closure of the set of 
points x such that f(x) 40. 

Let X be a topological space. A covering of X is locally finite if every 
point has a neighborhood which intersects only finitely many elements of 
the covering. A refinement of a covering of X is a second covering, each 
element of which is contained in an element of the first covering. A 
topological space is paracompact if it is Hausdorff, and every open cover- 
ing has a locally finite open refinement. 


Proposition 3.1. If X is a paracompact space, and if {U;} is an open 
covering, then there exists a locally finite open covering {V,} such that 
V, < U; for each i. 


Proof. Let {V,} be a locally finite open refinement of {U,}. For each k 
there is an index i(k) such that } c Uy. We let W, be the union of 
those KW such that i(k)=i. Then the W, form a locally finite open 
covering, because any neighborhood of a point which meets infinitely 
many W, must also meet infinitely many ),. 


Proposition 3.2. If X is paracompact, then X is normal. If, further- 
more, {U,} is a locally finite open covering of X, then there exists a 
locally finite open covering {V,} such that V,< U,. 

Proof. We refer the reader to Bourbaki [Bou 68]. 


Observe that Proposition 3.1 shows that the insistence that the index- 
ing set of a refinement be a given one can easily be achieved. 
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A partition of unity (of class C?) on a manifold X consists of an open 
covering {U;} of X and a family of functions 


WX OR 
satisfying the following conditions: 


PU 1. For all x eX we have w;(x) 2 0. 
PU 2. The support of W; is contained in U,. 
PU 3. The covering is locally finite. 

PU 4. For each point x € X we have 


y W;(x) = 1. 


(The sum is taken over all i, but is in fact finite for any given point x in 
view of PU 3.) 

We sometimes say that {(U,, W,)} is a partition of unity. 

A manifold X will be said to admit partitions of unity if it is para- 
compact, and if, given a locally finite open covering {U;}, there exists a 
partition of unity {y,} such that the support of w; is contained in U,. 

If {U;} is a covering of X, then we say that a covering {Y} is subordi- 
nated to {U,} if each \, is contained in some U,. 

It is desirable to give sufficient conditions on a manifold in order to 
insure the existence of partitions of unity. There is no difficulty with the 
topological aspects of this problem. It is known that a metric space is 
paracompact (cf. Bourbaki [Bou 68], [Ke 55]), and on a paracompact 
space, one knows how to construct continuous partitions of unity (loc. 
cit.). However, in the case of infinite dimensional manifolds, certain diffi- 
culties arise to construct differentiable ones, and it is known that a 
Banach space itself may not admit partitions of unity (say of class C%). 
The construction of differentiable partitions of unity depends on the 
construction of a differentiable norm. Readers will find examples, theo- 
rems, and counterexamples in [BoF 65], [BoF 66], and [Re 64]. In the 
finite dimensional case, the existence will follow from the next theorem. 


If E is a Banach space, we denote by B,(a) the open ball of radius r 
and center a, and by B,(a) the closed ball of radius r and center a. If 
a=0, then we write B, and B, respectively. Two open balls (of finite 
radius) are obviously C®-isomorphic. If X is a manifold and (V, @) is a 
chart at a point x e X, then we say that (V, ~) (or simply V) is a ball of 
radius r if @V is a ball of radius r in the Banach space. 


Theorem 3.3. Let X be a manifold which is locally compact, Hausdorff, 
and whose topology has a countable base. Given an open covering of X, 
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then there exists an atlas {(V,, p,)} such that the covering {V,} is locally 
finite and subordinated to the given covering, such that 9,V, is the open 
ball B;,, and such that the open sets W, = o, '(B,) cover X. 


Proof. Let U,, U,, ... be a basis for the open sets of X such that 
each U, is compact. We construct inductively a sequence A,, Ap, ... of 
compact sets whose union is X, such that A; is contained in the interior 
of A,;,,. We let A, = U,. Suppose we have constructed A;. We let j be 
the smallest integer such that A; is contained in U,U-:-UU;. We let 
A;4, be the closed and compact set 


UO UU, UU: 


For each point x € X we can find an arbitrarily small chart (V,, @,) at 
x such that o,V, is the ball of radius 3 (so that each V, is contained in 
some element of U). We let W, = o,'(B,) be the ball of radius 1 in this 
chart. We can cover the set 


Aj,, — Int(A;) 


(intuitively the closed annulus) by a finite number of these balls of radius 
1, say W,, ...,.W,, such that, at the same time, each one of VY, ...,V, is 
contained in the open set Int(A,;,,) — A;-, (intuitively, the open annulus 
of the next bigger size). We let 8; denote the collection V,, ...,V, and let 
% be composed of the union of the 8;. Then 8% is locally finite, and we 
are done. 


Corollary 3.4. Let X be a manifold which is locally compact Hausdorff, 
and whose topology has a countable base. Then X admits partitions of 
unity. 


Proof. Let {(V;,, @,)} be as in the theorem, and W, = 9, '(B,). We can 
find a function w, of class C? such that 0< yw, <1, such that y,(x) = 1 
for xe W, and y,(x)=0 for x¢ y,. (The proof is recalled below.) We 


now let 
W = Ds We 


(a sum which is finite at each point), and we let »,=w,/W. Then 
{(Y, %)} is the desired partition of unity. 


We now recall the argument giving the function y,. First, given two 
real numbers r, s with 0 <r < ss, the function defined by 


= 
aAP (, Hee 5) 
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in the open interval r<t<s and 0 outside the interval determines a 
bell-shaped C%-function from R into R. Its integral from minus infinity 
to t, divided by the area under the bell yields a function which lies 
strictly between 0 and 1 in the interval r<t<s, is equal to 0 fort Sr 
and is equal to 1 for t =s. (The function is even monotone increasing.) 

We can therefore find a real valued function of a real variable, say 
n(t), such that y(t) = 1 for |t} <1 and n(t)=0 for |t| 21+ 6 with small 
6, and such that 0O<7 <1. If E is a Hilbert space, then n(|x|?) = (x) 
gives us a function which is equal to 1 on the ball of radius 1 and 0 
outside the ball of radius 1 + 6. This function can then be transported to 
the manifold by any given chart whose image is the ball of radius 3. 

In a similar way, one would construct a function which is > 0 on a 
given ball and = 0 outside this ball. 

Partitions of unity constitute the only known means of gluing together 
local mappings (into objects having an addition, namely vector bundles, 
discussed in the next chapter). It is therefore important, in both the 
Banach and Hilbert cases, to determine conditions under which they 
exist. In the Banach case, there is the added difficulty that the argument 
just given to get a local function which is 1 on B, and 0 outside B, fails 
if one cannot find a differentiable function of the norm, or of an equiva- 
lent norm used to define the Banachable structure. 

Even though it is not known whether Theorem 3.3 extends to Hilbert 
manifolds, it is still possible to construct partitions of unity in that case. 
As Eells pointed out to me, Dieudonné’s method of proof showing that 
separable metric space is paracompact can be applied for that purpose 
‘(this is Lemma 3.5 below), and I am indebted to him for the following 
exposition. 

We need some lemmas. We use the notation ‘A for the complement of 
a set A. 

Let M be a metric space with distance function d. We can then speak 
of open and closed balls. For instance B,(x) denotes the closed ball of 
radius a with center x. It consists of all points y with d(y,x)<a. An 
open subset V of M will be said to be scalloped if there exist open balls 
U, U,, ...,U,, in M such that 


V =UNTU,0--° NT. 


A covering {V;} of a subset W of M is said to be locally finite (with 
respect to W) if every point x € W has a neighborhood which meets only 
a finite number of elements of the covering. 


Lemma 3.5. Let M be a metric space and {U,} (i = 1, 2, ...) a countable 
covering of a subset W by open balls. Then there exists a locally finite 
open covering {V,} (i=1,2,...) of W such that V,< U, for all i, and 
such that V, is scalloped for all i. 
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Proof. We define V, inductively as follows. Each U, is a ball, say 
B,,(x;). Let Vj = U,. Having defined V;_,, let 


1 1 


Wye = Ay eee Ni-,i = Gi-1 TT 


i i 
and let 
V,=U;0 ‘B,(x4) CNPP EN B,,_, %i-1)> 


it being understood that a ball of negative radius is empty. Then each V, 
is scalloped, and is contained in U,. We contend that the V; cover W. 
Indeed, let x be an element of W. Let j be the smallest index such that 
xeU;. Then xe J, for otherwise, x would be in the complement of V; 
which is equal to the union of ‘U; and the balls 


B,, (x1) Ur B,., (%j-1): 


Hence x would lie in some U, with i <j, contradiction. 

There remains to be shown that our covering {V;,} is locally finite. Let 
xeW. Then x lies in some U,. Let s be such a small number > 0 
that the ball B,(x) is contained in U,. Let t =s/2. For all i sufficiently 
large, the ball B,(x) is contained in B, -1/;(x,) = B,,,(x,) and therefore this 
ball does not meet V,. We have found a neighborhood of x which meets 
only a finite number of members of our covering, which is consequently 
locally finite (with respect to W). 


Lemma 3.6. Let U be an open ball in Hilbert space E and let 
V=U0NTU,0°::0U, 


be a scalloped open subset. Then there exists a C”-function a: E>R 
such that w(x) > 0 if x € V and w(x) = 0 otherwise. 


Proof. For each U, let o;; E—R be a function such that 


O<o(x)<1 if xe, 
g(x)=1 if xeU,. 


Let ~(x) be a function such that g(x) >0 on U and (x) = 0 outside U. 
Let 


co(x) = (x) T] (1 — gi). 


Then w(x) satisfies our requirements. 
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Theorem 3.7. Let A,, A, be non-void, closed, disjoint subsets of a 
separable Hilbert space E. Then there exists a C®-function p:E—>R 
such that W(x) =0 if xe A, and W(x) =1 if xEA,, and OS W(x) S 1 
for all x. 


Proof. By Lindeléf’s theorem, we can find a countable collection of 
open balls {U,} (i= 1,2,...) covering A, and such that each U, is con- 
tained in the complement of A,. Let W be the union of the U;. We find 
a locally finite refinement {V;} as in Lemma 3.5. Using Lemma 3.6, we 
find a function @; which is > 0 on V; and 0 outside Y. Let ow = Ya; 
(the sum is finite at each point of W). Then w(x)>0 if xeA,, and 
o(x)=0if xe A,. 

Let U be the open neighborhood of A, on which @ is > 0. Then A, 
and ‘U are disjoint closed sets, and we can apply the above construction 
to obtain a function ¢: E>R which is > 0 on ‘UU and = 0 on A,. We 
let W = w/(o + w). Then w satisfies our requirements. 


Corollary 3.8. Let X be a paracompact manifold of class C’, modeled 
on a separable Hilbert space E. Then X admits partitions of unity (of 
class C?). 


Proof. It is trivially verified that an open ball of finite radius in E 
is C®’-isomorphic to E. (We reproduce the formula in Chapter VIL) 
Given any point xe X, and a neighborhood N of x, we can therefore 
always find a chart (G,y) at x such that yG=E, and GCN. Hence, 
given an open covering of X, we can find an atlas {(G,, y,)} subordinated 
to the given covering, such that y,G,=E. By paracompactness, we can 
find a refinement {U;} of the covering {G,} which is locally finite. Each 
U, is contained in some G,, and we let ; be the restriction of y,¢) to U;. 
We now find open refinements {V;} and then {W;} such that 


WcVc¥c, 


the bar denoting closure in X. Each V, being closed in X, it follows from 
our construction that 9,V, is closed in E, and so is g,W,. Using the 
theorem, and transporting functions on E to functions on X by means of 
the y,, we can find for each i a C?-function y,: X >R with is 1 on W 
and 0 on X— JV. We let y=)y;, and 0,=y,/p. Then the collection 


{6,} is the desired partition of unity. 
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Let E be a Banach space, and 4:E —R a continuous linear map into R. 
(This will also be called a functional on E.) We denote by E® the kernel 
of A, and by Ef (resp. Ej) the set of points xeE such that A(x) 20 
(resp. A(x) S$ 0). We call E? a hyperplane and Ej or Ej a half plane. 
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If uw is another functional and Ej =E;, then there exists a number 
c>O such that 4 =cp. This is easily proved. Indeed, we see at once 
that the kernels of 4 and uw must be equal. Suppose 4 #0. Let x9 be 
such that A(x9) > 0. Then p(xo) > 0 also. The functional 


A= (A(Xo)/H(Xo))H 


vanishes on the kernel of A (or ) and also on x9. Therefore it is the 0 
functional, and c = A(xo)/u(Xo). 

Let E, F be Banach spaces, and let Ej and F/ be two half planes in 
E and F respectively. Let U, V be two open subsets of these half planes 
respectively. We shall say that a mapping 


f:U-V 


is a morphism of class C? if the following condition is satisfied. Given a 
point x € U, there exists an open neighborhood U, of x in E, an open 
neighborhood V, of f(x) in F, and a morphism f,: U, > V, (in the sense 
of Chapter I) such that the restriction of f; to U, VU is equal to f. (We 
assume that all morphisms are of class C? with p = 1.) 

If our half planes are full planes (ie. equal to the vector spaces them- 
selves), then our present definition is the same as the one used previously. 

If we take as objects the open subsets of half planes in Banach spaces, 
and as morphisms the C?-morphisms, then we obtain a category. The 
notion of isomorphism is therefore defined, and the definition of manifold 
by means of atlases and charts can be used as before. The manifolds of 
§1 should have been called manifolds without boundary, reserving the 
name of manifold for our new globalized objects. However, in most of 
this book, we shall deal exclusively with manifolds without boundary 
for simplicity. The following remarks will give readers the means of 
extending any result they wish (provided it is true) for the case of mani- 
folds without boundaries to the case manifolds with. 

First, concerning the notion of derivative, we have: 


Proposition 4.1. Let f: UF and g: U >F be two morphisms of class 


C? (p = 1) defined on an open subset U of E. Assume that f and g have 
the same restriction to U 0 Ej} for some half plane Ej, and let 


xeUnkj. 
Then f’(x) = g'(x). 
Proof. After considering the difference of f and g, we may assume 


without loss of generality that the restriction of f to UNEj is 0. It is 
then obvious that f’(x) = 0. 
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Proposition 4.2. Let U be open in E. Let pw be a non-zero functional on 
F and let f: UF be a morphism of class C? with p21. If x is a 
point of U such that f(x) lies in F? then f'(x) maps E into F). 


Proof. Without loss of generality, we may assume that x =0 and 
f(x) =0. Let W be a given neighborhood of 0 in F. Suppose that we 
can find a small element ve E such that pf’(0)v #0. We can write (for 
small t): 


S(tv) = f'O)v + o(t)w, 


with some element w,e W. By assumption, f(tv) lies in F'. Applying pu 
we get 
tuf’(O)v + o(t)u(w,) 2 0. 


Dividing by ft, this yields 


ufo = Won, 


Replacing t by —t, we get a similar inequality on the other side. Letting 
t tend to 0 shows that pf’(0)v = 0, a contradiction. 


Let U be open in some half plane Ej. We define the boundary of U 
(written dU) to be the intersection of U with E°, and the interior of U 
(written Int(U)) to be the complement of dU in U. Then Int(U) is open 
in E. 

It follows at once from our definition of differentiability that a half 
plane is C®-isomorphic with a product 


Ej ~ E? x R* 


where R* is the set of real numbers = 0, whenever 1 #0. The boundary 
of Ej in that case is E? x 0. 


Proposition 4.3. Let 4 be a functional on E and y a functional on F. 
Let U be open in Ej and V open in F, and assume UNE?, VE) are 
not empty. Let f: UV be an isomorphism of class C? (p 2 1). Then 
2 #0 if and only if u #0. If 2 #0, then f induces a C?-isomorphism 
of Int(U) on Int(V) and of dU on OV. 


Proof. By the functoriality of the derivative, we know that f’(x) is a 
toplinear isomorphism for each xe U. Our first assertion follows from 
the preceding proposition. We also see that no interior point of U maps 
on a boundary point of V and conversely. Thus f induces a bijection of 
oU on OV and a bijection of Int(U) on Int(V). Since these interiors are 
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open in their respective spaces, our definition of derivative shows that f 
induces an isomorphism between them. As for the boundary, it is a 
submanifold of the full space, and locally, our definition of derivative, 
together with the product structure, shows that the restriction of f to dU 
must be an isomorphism on OV. 


This last proposition shows that the boundary is a differentiable in- 
variant, and thus that we can speak of the boundary of a manifold. 

We give just two words of warning concerning manifolds with bound- 
ary. First, products do not exist in their category. Indeed, to get 
products, we are forced to define manifolds with corners, which would 
take us too far afield. 

Second, in defining immersions or submanifolds, there is a difference in 
kind when we consider a manifold embedded in a manifold without 
boundary, or a manifold embedded in another manifold with boundary. 
Think of a closed interval embedded in an ordinary half plane. Two 
cases arise. The case where the interval lies inside the interior of the half 
plane is essentially distinct from the case where the interval has one end 
point touching the hyperplane forming the boundary of the half plane. 
(For instance, given two embeddings of the first type, there exists an 
automorphism of the half plane carrying one into the other, but there 
cannot exist an automorphism of the half plane carrying an embedding 
of the first type into one of the second type.) 

We leave it to the reader to go systematically through the notions of 
tangent space, immersion, embedding (and later, tangent bundle, vector 
field, etc.) for arbitrary manifolds (with boundary). For instance, Proposi- 
tion 2.2 shows at once how to get the tangent space functorially. 


CHAPTER _IIl 


Vector Bundles 


The collection of tangent spaces can be glued together to give a manifold 
with a natural projection, thus giving rise to the tangent bundle. The 
general glueing procedure can be used to construct more general objects 
known as vector bundles, which give powerful invariants of a given 
manifold. (For an interesting theorem see Mazur [Maz 61].) In this 
chapter, we develop purely formally certain functorial constructions hav- 
ing to do with vector bundles. In the chapters on differential forms and 
Riemannian metrics, we shall discuss in greater detail the constructions 
associated with multilinear alternating forms, and symmetric positive 
definite forms. 

Partitions of unity are an essential tool when considering vector bun- 
dles. They can be used to combine together a random collection of 
morphisms into vector bundles, and we shall give a few examples show- 
ing how this can be done (concerning exact sequences of bundles). 
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Let X be a manifold (of class C? with p20) and let 1:E—>X be a 
morphism. Let E be a Banach space. 
Let {U,} be an open covering of X, and for each i, suppose that we 
are given a mapping 
1:7 *(U,) + U, x E 


satisfying the following conditions: 
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VB 1. The map 1; is a C? isomorphism commuting with the projection 
on U,, that is, such that the following diagram is commutative: 


n(U,) —> U,xE 


yy 


In particular, we obtain an isomorphism on each fiber (written 
T(x) or Tix) 
Tix: M 1(x) 2 E 


VB 2. For each pair of open sets U;, U; the map 


-1. 
Tjx OT, 1 EE 


is a toplinear isomorphism. 


VB 3. If U; and U; are two members of the covering, then the map of 
U;, 0 U, into L(K, E) (actually Laut(E)) given by 


-1 
Xt (t;T; *)x 
is a morphism. 


Then we shall say that {(U,, t;)} is a trivializing covering for x (or for 
E by abuse of language), and that {r,} are its trivalizing maps. If x € U,, 
we say that 1, (or U,) trivializes at x. Two trivializing coverings for x are 
said to be VB-equivalent if taken together they also satisfy conditions 
VB 2, VB 3. An equivalence class of such trivializing coverings is said to 
determine a structure of vector bundle on x (or on E by abuse of lan- 
guage). We say that E is the total space of the bundle, and that X is its 
base space. If we wish to be very functorial, we shall write E, and X, for 
these spaces respectively. The fiber x~*(x) is also denoted by E, or z,. 
We also say that the vector bundle has fiber E, or is modeled on E. Note 
that from VB 2, the fiber x~'(x) above each point x eX can be given a 
structure of Banachable space, simply by transporting the Banach space 
structure of E to 2'(x) via t,,. Condition VB 2 insures that using 
two different trivializing maps 1;, or 1;, will give the same structure of 
Banachable space (with equivalent norms, of course not the same norms). 

Conversely, we could replace VB 2 by a similar condition as 
follows. 
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VB 2’. On each fiber n~'(x) we are given a structure of Banachable 
space, and for x € U,, the trivializing map 


Ti (x) = E,>E 
is a toplinear isomorphism. 


Then it follows that t,, 0 1),': EE is a toplinear isomorphism for each 
pair of open sets U;, U; and x € U,N Uj. 
In the finite dimensional case, condition VB 3 is implied by VB 2. 


Proposition 1.1. Let E, F be finite dimensional vector spaces. Let U be 
open in some Banach space. Let 


f:UxE-F 
be a morphism such that for each x € U, the map 
fi EOF 


given by f,(v) = f(x, v) is a linear map. Then the map of U into L(E, F) 
given by xt+f,, is a morphism. 


Proof. We can write F = R, x --: x R, (n copies of R). Using the fact 
that L(E, F) = L(E, R,) x --- x L(E,R,,), it will suffice to prove our asser- 
tion when F = R. Similarly, we can assume that E = R also. But in that 
case, the function f(x, v) can be written g(x)v for some map g:U OR. 
Since f is a morphism, it follows that as a function of each argument x, 
v it is also a morphism. Putting v = 1 shows that g is a morphism and 
concludes the proof. 


Returning to the general definition of a vector bundle, we call the 
maps 
Tix = Tx © Tig 
the transition maps associated with the covering. They satisfy what we 
call the cocycle condition 


Tkjx © Tix = Tix: 


; : he 
In particular, t;,, = id and 1), = tz. 
As with manifolds, we can recover a vector bundle from a trivializing 


covering. 
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Proposition 1.2. Let X be a manifold, and xn: E- X a mapping from 
some set E into X. Let {U;} be an open covering of X, and for each i 
suppose that we are given a Banach space E and a bijection (commuting 
with the projection on U;), 


tq, 1(U;) > U; x E, 


such that for each pair i, j and x€U,QU,, the map (t,t; 1) is a 
toplinear isomorphism, and condition VB 3 is satisfied as well as the 
cocycle condition. Then there exists a unique structure of manifold on E 
such that x is a morphism, such that 1; is an isomorphism making 1 into 
a vector bundle, and {(U;, t;)} into a trivialising covering. 


Proof. By Proposition 3.10 of Chapter I and our condition VB 3, we 
conclude that the map 


1)7; 1: (U;0U,) x E>(U;NU,) x E 


is a morphism, and in fact an isomorphism since it has an inverse. From 
the definition of atlases, we conclude that E has a unique manifold 
structure such that the t; are isomorphisms. Since x is obtained locally 
as a composite of morphisms (namely 1; and the projections of U; x E on 
the first factor), it becomes a morphism. On each fiber x(x), we can 
transport the topological vector space structure of any E such that x lies 
in U;, by means of 1;,. The result is independent of the choice of U; since 
(t;t;'), is a toplinear isomorphism. Our proposition is proved. 


Remark. It is relatively rare that a vector bundle is trivial, i. VB- 
isomorphic to a product X x E. By definition, it is always trivial locally. 
In the finite dimensional case, say when E has dimension n, a trivializa- 
tion is equivalent to the existence of sections €,, ...,€, such that for each 
x, the vectors &,(x), ...,€,(x) form a basis of E,. Such a choice of 
sections is called a frame of the bundle, and is used especially with the 
tangent bundle, to be defined below. In this book where we give proofs 
valid in the infinite dimensional case, frames will therefore not occur until 
we get to strictly finite dimensional phenomenon. 


The local representation of a vector bundle and 
the vector component of a morphism 


For arbitrary vector bundles (and especially the tangent bundle to 
be defined below), we have a local representation of the bundle as a 
product in a chart. For many purposes, and especially the case of a 
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morphism 
f:YoE 


of a manifold into the vector bundle, it is more convenient to use U to 
denote an open subset of a Banach space, and to let g: UX be an 
isomorphism of U with an open subset of X over which E has a triviali- 
zation t: 2 1(gU) > U x E. Suppose V is an open subset of Y such that 
f(V) < x 4(@U). We then have the commutative diagram: 


V2 AGU = URE 


—— 


gU —> U 


The composite tof is a morphism of V into U x E, which has two 
components 


tof =(fui, fu2) 


such that fy,: V-> U and fy,: V>E. We call fy. the vector component of 
f in the vector bundle chart U x E over U. Sometimes to simplify the 
notation, we omit the subscript, and merely agree that f, = fy, denotes 
this vector component; or to simplify the notation further, we may sim- 
ply state that f itself denotes this vector component if a discussion takes 
place entirely in a chart. In this case, we say that f = fy represents the 
morphism in the vector bundle chart, or in the chart. 


Vector bundle morphisms and pull backs 
We now make the set of vector bundles into a category. 
Let 2: EX and 7’: E'> X’ be two vector bundles. A VB-morphism 
m— x’ consists of a pair of morphisms 
fo: X > X' and f:EoE 


satisfying the following conditions. 


VB Mor 1. The diagram 
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is commutative, and the induced map for each x € X 
fe: Ey > Eqay 
is a continuous linear map. 
VB Mor 2. For each x, ¢X there exist trivializing maps 


t:m \(U)>UXxE 
and 
tin 4(U') 3 U’ x EF’ 


at X9 and f(x) respectively, such that f)(U) is contained in 
U’, and such that the map of U into L(E, E’) given by 


, -1 
XT Of OT 
is a morphism. 


As a matter of notation, we shall also use f to denote the VB- 
morphism, and thus write f:— 7’. In most applications, f) is the iden- 
tity. By Proposition 1.1, we observe that VB Mor 2 is redundant in the 
finite dimensional case. 

The next proposition is the analogue of Proposition 1.2 for VB- 
morphisms. 


Proposition 1.3. Let 2, x’ be two vector bundles over manifolds X, X' 
respectively. Let f: X > X’ be a morphism, and suppose that we are 
given for each x € X a continuous linear map 


Se! Me > Moxy 


such that, for each x9, condition VB Mor 2 is satisfied. Then the map 
f from x to x’ defined by f, on each fiber is a VB-morphism. 


Proof. One must first check that f is a morphism. This can be done 
under the assumption that 2, x’ are trivial, say equal to U x E and 
U' x E’ (following the notation of VB Mor 2), with trivialising maps 
equal to the identity. Our map f is then given by 


(x, v) ie. (fox, f.0). 


Using Proposition 3.10 of Chapter I, we conclude that f is a morphism, 
and hence that (fo, f) is a VB-morphism. 
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It is clear how to compose two VB-morphisms set theoretically. In 
fact, the composite of two VB-morphisms is a VB-morphism. There is 
no problem verifying condition VB Mor 1, and for VB Mor 2, we look at 
the situation locally. We encounter a commutative diagram of the fol- 
lowing type: 

f g 


m(U) > atu) 2s aU") 


, ” 


tT tT Tt 


Ux E —— U’x EF’ —  U’ xk’ 


and use Proposition 3.10 of Chapter I, to show that gof is a VB- 
morphism. 

We therefore have a category, denoted by VB or VB?, if we need to 
specify explicitly the order of differentiability. 

The vector bundles over X from a subcategory VB(X) = VB?(X) (tak- 
ing those VB-morphisms for which the map fo is the identity). If WW is a 
category of Banach spaces (for instance finite dimensional spaces), then 
we denote by VB(X, 2) those vector bundles over X whose fibers lie in 
W. 

A morphism from one vector bundle into another can be given locally. 
More precisely, suppose that U is an open subset of X and 2:E>X a 
vector bundle over X. Let Ey = x~'(U) and 


Ty = 1\|Ey 


be the restriction of x to Ey. Then zy is a vector bundle over U. Let 
{U,} be an open covering of the manifold X and let z, 2’ be two vector 
bundles over X. Suppose, given a VB-morphism 


Si: hy, > Ty, 


for each i, such that f; and f; agree over U; 7 U; for each pair of indices i, 
j. Then there exists a unique VB-morphism f:2— 2' which agrees with 
f; on each U;. The proof is trivial, but the remark will be used frequently 
in the sequel. 

Using the discussion at the end of Chapter II, §2 and Proposition 2.7 
of that chapter, we get immediately: 


Proposition 1.4. Let 2:E—Y be a vector bundle, and f:X—>Y a 
morphism. Then 


F*(): f*(E) > X 
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is a vector bundle, and the pair (f, x*(f)) is a VB-morphism 


m(f) 


IME) > E 
ro | 
x — Y 

f 


In Proposition 1.4, we could take f to be the inclusion of a submani- 
fold. In that case, the pull-back is merely the restriction. As with open 
sets, we can then use the usual notation: 


Exy=n'(X) and = ny=R2\Ey. 


Thus zy = f*(z) in that case. 
If X happens to be a point y of Y, then we have the constant map 


ty: Ey y 


which will sometimes be identified with E,,. 

If we identify each fiber (f*E), with E,,,, itself (a harmless identifica- 
tion since an element of the fiber at x is simply a pair (x, e) with e in 
Ey x), then we can describe the pull-back f* of a vector bundle 1: E> Y 
as follows. It is a vector bundle f*z: f*E > X satisfying the following 
properties: 


PB 1. For each x € X, we have (f*E), = Ep. 


PB 2. We have a commutative diagram 


JE) — > E 
ro | 
xX — Y 

f 


the top horizontal map being the identity on each fiber. 


PB 3. If E is trivial, equal to Y x E, then f*E = X x E and f*n is the 
projection. 


PB 4. If V is an open subset of Y and U = f~1(V), then 


f*(Ey) = (f*E)y, 
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and we have a commutative diagram: 


Ill, §2. THE TANGENT BUNDLE 


Let X be a manifold of class C? with p= 1. We shall define a functor T 
from the category of such manifolds into the category of vector bundles 
of class C?“}. 

For each manifold X we let T(X) be the disjoint union of the tangent 
spaces T,(X). We have a natural projection 


nm: T(X) > X 
mapping T,(X) on x. We must make this into a vector bundle. If (U, ¢) 
is a chart of X such that @U is open in the Banach space EK, then from 
the definition of the tangent vectors as equivalence classes of triples 
(U, ~, v) we get immediately a bijection 
ty:n (UU) = TU) > UKE 


which commutes with the projection on U, that is such that 


nm (U) —4 UXxE 


U 
is commutative. Furthermore, if (U;, g;) and (U;, @;) are two charts, and 
if we denote by ¢,; the map 9,9; ' (defined on g(U,>U,)), then we 
obtain a transition mapping 


Ti = (t,t; *): e(U,0 U;) xX E> @(U,9 U;) xE 


by the formula 
Tji(X, v) = (QjiX, Do;(x) *v) 


for x€U,U, and ve E. Since the derivative De, = oj, is of class C?~* 
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and is an isomorphism at x, we see immediately that all the conditions of 
Proposition 1.2 are verified (using Proposition 3.10 of Chapter I), thereby 
making T(X) into a vector bundle of class C?~?. 

We see that the above construction can also be expressed as follows. 
If the manifold X is glued together from open sets {U,} in Banach 
spaces by means of transition mappings {g,;}, then we can glue together 
products U; x E by means of transition mappings (gj, Dg) where the 
derivative Dg can be viewed as a function of two variables (x, v). Thus 
locally, for open subsets U of Banach spaces, the tangent bundle can be 
identified with the product U x E. The reader will note that our defini- 
tion coincides with the oldest definition employed by geometers, our 
tangent vectors being vectors which transform according to a certain rule 
(namely the derivative). 

If f: X > X’ is a C’-morphism, we can define 


Tf: T(X) > T(X’) 


to be simply T,f on each fiber T,(X). In order to verify that Tf is a 
VB-morphism (of class C?~'), it suffices to look at the situation locally, 
ie. we may assume that X and X’ are open in vector spaces E, E’, and 
that T, f = f’(x) is simply the derivative. Then the map Tf is given by 


Tf(x, v) = (09), fv) 


for xe X and ve E. Since f’ is of class C?* by definition, we can apply 
Proposition 3.10 of Chapter I to conclude that Tf is also of class C?"!. 
The functoriality property is trivially satisfied, and we have therefore 
defined the functor T as promised. 

It will sometimes be notationally convenient to write f, instead of Tf 
for the induced map, which is also called the tangent map. The bundle 
T(X) is called the tangent bundle of X. 


Remark. The above definition of the tangent bundle fits with 
Steenrod’s point of view [Ste 51]. I don’t understand why many differen- 
tial geometers have systematically rejected this point of view, when they 
take the definition of a tangent vector as a differential operator. 


lll, §3. EXACT SEQUENCES OF BUNDLES 
Let X be a manifold. Let 2’: EF’ > X and x: E—X be two vector bun- 
dles over X. Let f:2'—>2 be a VB-morphism. We shall say that the 


sequence 


Ga oe 
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is exact if there exists a covering of X by open sets and for each open set 
U in this covering there exist trivializations 


t:EyoU xk and tiEy>UxE 


such that E can be written as a product E = E’ x F, making the follow- 
ing diagram commutative: 
f 


By > Ey 


Ux EF — UxExF 


(The bottom map is the natural one: Identity on U and the injection of 
FE’ on FE’ x 0.) 

Let 2,: E, > X be another vector bundle, and let g:z,->72 be a VB- 
morphism such that g(E,) is contained in f(E’). Since f establishes a 
bijection between E’ and its image f(E’) in E, it follows that there exists 
a unique map g,: E, > E’ such that g = fog,. We contend that g, is a 
VB-morphism. Indeed, to prove this we can work locally, and in view of 
the definition, over an open set U as above, we can write 


gp =t loprotog 


where pr is the projection of U x E’ x F on U x E’. All the maps on 
the right-hand side of our equality are VB-morphisms; this proves our 
contention. 

Let 2: EX be a vector bundle. A subset S of E will be called a 
subbundle if there exists an exact sequence 0 > 7’ > 7, also written 


o-b SE, 


such that f(E’)=S. This gives S the structure of a vector bundle, and 
the previous remarks show that it is unique. In fact, given another exact 
sequence 

0+E£,-SE 


such that g(E,)=S, the natural map f 'g from E, to E’ is a 
VB-isomorphism. 

Let us denote by E/E’ the union of all factor spaces E,/E‘,. If we are 
dealing with an exact sequence as above, then we can give E/E’ the 
structure of a vector bundle. We proceed as follows. Let {U;} be our 
covering, with trivialising maps t; and t;. We can define for each i a 
bijection 

nj: Ey,/Ey, > U; x F 


LIII, §3] EXACT SEQUENCES OF BUNDLES 51 


obtained in a natural way from the above commutative diagram. (With- 
out loss of generality, we can assume that the vector spaces E’, F are 
constant for all i) We have to prove that these bijections satisfy the 
conditions of Proposition 1.2. 

Without loss of generality, we may assume that f is an inclusion 
(of the total space E’ into E). For each pair i, j and xe U,U,, the 
toplinear automorphism (t,t; *), is represented by a matrix 


(ee i 

ha,(x)  hz2(x) 

operating on the right on a vector (v, w)¢ EK’ x F. The map (t/7/~*), on 
F is induced by this matrix. Since E’ = E’ x 0 has to be carried into 
itself by the matrix, we have h,,(x)=0. Furthermore, since (t;t;*), has 
an inverse, equal to (t,t; *),, it follows that h,(x) is a toplinear auto- 
morphism of F, and represents (tjt/~'),. Therefore condition VB 3 is 


satisfied, and E/E’ is a vector bundle. 
The canonical map 


Ey > Ey/Ey 


is a morphism since it can be expressed in terms of t, the projection, and 


t’-1. Consequently, we obtain a VB-morphism 


g:n>n" 


in the canonical way (on the total spaces, it is the quotient mapping of E 
on E/E’). We shall call x” the factor bundle. 

Our map g satisfies the usual universal mapping property of a 
cokernel. Indeed, suppose that 


Ww: E>G 


is a VB-morphism such that yo f=0 (ie. Wo f, =0 on each fiber E’,). 
We can then define set theoretically a canonical map 


W,: E/E’ > G, 


and we must prove that it is a VB-morphism. This can be done locally. 

Using the above notation, we may assume that E=U x E’ x F and 

that g is the projection. In that case, wy, is simply the canonical injection 

of U x F in U x E’ x F followed by y, and is therefore a VB-morphism. 
We shall therefore call g the cokernel of f. 
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Dually, let g: n> x” be a given VB-morphism. We shall say that the 
sequence 


n—+> 1" +0 
is exact if g is surjective, and if there exists a covering of X by open sets, 


and for each open set U in this covering there exist spaces E’, F and 
trivializations 


tiEyoUxE xF and tv’: Ey > F 


making the following diagram commutative: 


UxE x F — UxF 


(The bottom map is the natural one: Identity on U and the projection of 
E’ x F on F,) 

In the same way as before, one sees that the “kernel” of g, that is, the 
union of the kernels E’, of each g,, can be given a structure of vector 
bundle. This union E’ will be called the kernel of g, and satisfies the 
usual universal mapping property. 


Proposition 3.1. Let X be a manifold and let 
fit on 


be a VB-morphism of vector bundles over X. Assume that, for each 
x € X, the continuous linear map 


f: E> Ey 
is injective and splits. Then the sequence 


037’ at, 1 
is exact. 


Proof. We can assume that X is connected and that the fibers of E’ 
and E are constant, say equal to the Banach spaces E’ and E. Let ae X. 
Corresponding to the splitting of f, we know that we have a product 
decomposition E = E’ x F and that there exists an open set U of X 
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containing a, together with trivialising maps 
tat(U)>e>UxE and tin (U-UxE 
such that the composite map 
tj] —1 Sa 


Ei > Be 5 FO x F 


maps E’ on E’ x 0. 
For any point x in U, we have a map 


(tft’*),: E’ +E’ x F, 
which can be represented by a pair of continuous linear maps 
(hy 1 (x), hy, (x). 
We define 


h(x): E’ x FE’ x F 


te 
h(x) id)’ 


by the matrix 


53 


operating on the right on a vector (v, w)€ E’ x F. Then h(x) restricted to 


E’ x 0 has the same action as (tft’*),. 


The map x—h(x) is a morphism of U into L(E, E) and since it is 
continuous, it follows that for U small enough around our fixed point a, 
it maps U into the group of toplinear automorphisms of E. This proves 


our proposition. 
Dually to Proposition 3.1, we have: 
Proposition 3.2. Let X be a manifold and let 


g: nan" 


be a VB-morphism of vector bundles over X. Assume that for each 


x € X, the continuous linear map 
gx: E,, > EY 
is surjective and has a kernel that splits. Then the sequence 


n—>x" +0 


is exact. 
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Proof. It is dual to the preceding one and we leave it to the reader. 
In general, a sequence of VB-morphisms 
OSH nea 


is said to be exact if both ends are exact, and if the image of f is equal 
to the kernel of g. 

There is an important example of exact sequence. Let f: X > Y be an 
immersion. By the universal mapping property of pull backs, we have a 
canonical VB-morphism 


T*f: T(X) > f*T(Y) 


of T(X) into the pull back over X of the tangent bundle of Y. Further- 
more, from the manner in which the pull back is obtained locally by 
taking products, and the definition of an immersion, one sees that the 
sequence 


oe 
0 > T(X) > f*T(Y) 
is exact. The factor bundle 
f*T(Y)/Im(T*f) 


is called the normal bundle of f. It is denoted by N(f), and its total 
space by N,(X) if we wish to distinguish between the two. We sometimes 
identify T(X) with its image under T*f and write 


N(f) = f*T(Y)/T(X). 
Dually, let f: X — Y be a submersion. Then we have an exact sequence 
T(X) 5 f*T(Y) +0 
a kernel could be called the subbundle of f, or the bundle along the 
er. 


There is an interesting case where we can describe the kernel more 
precisely. Let 


at: E>X 


be a vector bundle. Then we can form the pull back of E over itself, that 
is, x*E, and we contend that we have an exact sequence 


0 > n*E > T(E) > n*T(X) > 0. 
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To define the map on the left, we look at the subbundle of x more 
closely. For each x € X we have an inclusion 


E,- E, 


x 


whence a natural injection 
T(E,.) > T(E). 


The local product structure of a bundle shows that the union of the 
T(E,,) as x ranges over X gives the subbundle set theoretically. On the 
other hand, the total space of z*E consists of pairs of vectors (v, w) lying 
over the same base point x, that is, the fiber at x of x*E is simply 
E, x E,. Since T(E,) has a natural identification with E, x E,, we get 
for each x a bijection 


(x*E), > T(E) 
which defines our map from 2*E to T(E). Considering the map locally 
in terms of the local product structure shows at once that it gives a 
VB-isomorphism between x*E and the subbundle of z, as desired. 
Ill, §4. OPERATIONS ON VECTOR BUNDLES 
We consider subcategories of Banach spaces QM, 8, € and let 

AU x BoC 
be a functor in, say, two variables, which is, say, contravariant in the first 
and covariant in the second. (Everything we shall do extends in the 
obvious manner to functors of several variables, letting 2, 8 stand for 
n-tuples.) 

Example. We took a functor in two variables for definiteness, and to 

illustrate both variances. However, we could consider a functor in one or 
more than two variables. For instance, let us consider the functor 


E+ L(E, R) = L(E) = E’, 


which we call the dual. It is a contravariant functor in one variable. On 
the other hand, the functor 


E+ L‘(E, F) 


of continuous multilinear maps of E x --: x E into a Banach space F is 
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contravariant in E and covariant in F. The functor E+» L"(E, R) gives 
rise later to what we call differential forms. We shall treat such forms 
systematically in Chapter V, §3. 

If f: E’>E and g:F—F' are two continuous linear maps, with f a 
morphism of Q and g a morphism of 8, then by definition, we have a 
map 


L(E’, E) x L(F, F’) > L(A(E, F), (E, F’)), 


assigning A(f, g) to (f, g). 
We shall say that 4 is of class C” if the following condition is satisfied. 
Give a manifold U, and two morphisms 


g: U > L(E, E) and Ww: U > L(F, F’), 
then the composite 
U > L(E’,, E) x LiF, F) > LUE, F), A(E’, F’)) 
is also a morphism. (One could also say that A is differentiable.) 


Theorem 4.1. Let 4 be a functor as above, of class C?, p=0. Then for 
each manifold X, there exists a functor Ay, on vector bundles (of class 
C?) 

Ax: VB(X, WW) x VB(X, B) > VB(X, ©) 


satisfying the following properties. For any bundles a, B in VB(X, QW) 
and VB(X, 8) respectively, and VB-morphisms 


fia>a and g: Bop 


in the respective categories, and for each x € X, we have: 
OP 1. Ax(%, B)x ad AG, B). 


OP 2. Ax(f, Ix = Mies 9x): 


OP 3. If « is the trivial bundle X x E and 8 the trivial bundle X x F, 
then A,y(«, B) is the trivial bundle X x A(E, F). 


OP 4. If h: YX is a C”-morphism, then 
Ay(h*a, h*B) = h*Ax(a, B). 


Proof. We may assume that X is connected, so that all the fibers 
are toplinearly isomorphic to a fixed space. For each open subset U 
of X we let the total space Ay(E,, Eg) of Ay(a, B) be the union of the 
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sets 

{x} x A(a., Be) 
(identified harmlessly throughout with A(a,, B,)), as x ranges over U. We 
can find a covering {U,} of X with trivializing maps {t,;} for a, and {o;} 


for B, 
Te a~*(U;) oa U; x E, 


o;: B-*(U;) > U, x F. 
We have a bijection 
A(t;", 03): Ay,(Exs Eg) > U; x A(E, F) 
obtained by taking on each fiber the map 
Mctix's Ojx): Mo 5 B) 3 A(E, F). 
We must verify that VB 3 is satisfied. This means looking at the map 
x—> Mtjxs Gjx) ° A(t’ Oy) * 
The expression on the right is equal to 
MCixTix's Ox ix: ). 
Since 4 is a functor of class C”, we see that we get a map 
U,0 U; > L(A(E, F), A(E, F)) 
which is a C?-morphism. Furthermore, since A is a functor, the transition 
mappings are in fact toplinear isomorphisms, and VB2, VB 3 are proved. 
The proof of the analogous statement for A,(f, g), to the effect that it 
is a VB-morphism, proceeds in an analogous way, again using the hy- 
pothesis that A is of class C?. Condition OP 3 is obviously satisfied, and 
OP 4 follows by localizing. This proves our theorem. 
The next theorem gives us the uniqueness of the operation /,. 
Theorem 4.2. If u is another functor of class C? with the same variance 


as 4, and if we have a natural transformation of functors t: 2 — py, then 
for each X, the mapping 


ty: Ay > by, 


58 VECTOR BUNDLES [IIT, §4] 


defined on each fiber by the map 
t(a.5 B.): Max» Bx) > U(x» Bx), 
is a natural transformation of functors (in the VB-category). 
Proof. For simplicity of notation, assume that 4 and yw are both 


functors of one variable, and both covariant. For each open set U = U, 
of a trivializing covering for B, we have a commutative diagram: 


vega SS Ge 
Ay(o) Hy(o) 
Ay(B) ——————> by (B) 
ty 


The vertical maps are trivializing VB-isomorphisms, and the top horizon- 
tal map is a VB-morphism. Hence ty is a VB-morphism, and our asser- 
tion is proved. 


In particular, for 4=y and t=id we get the uniqueness of our 
functor Ay. 

(In the proof of Theorem 4.2, we do not use again explicitly the hypo- 
theses that J, » are differentiable.) 

In practice, we omit the subscript X on A, and write 4 for the functor 
on vector bundles. 


Examples. Let 2: E— X be a vector bundle. We take 4 to be the 
dual, that is EY EY = L(E,R). Then J(E£) is denoted by EY, and is 
called the dual bundle. The fiber at each point x eX is the dual space 
Ex. The dual bundle of the tangent bundle is called the cotangent bundle 
TYX. 

Similarly, instead of taking L(E), we could take L7(E) to be the bundle 
of alternating multilinear forms on E. The fiber at each point is the 
space L'(E,) consisting of all r-multilinear alternating continuous func- 
tions on E,. When E=TX is the tangent bundle, the sections of 
L’‘(TX) are called differential forms of degree r. Thus a 1-form is a 
section of EY’. Differential forms will be treated later in detail. 

Recall that End(E) = L(E, E). In the theory of curvature, we shall deal 
with both functors 


ES L‘(E)=L*4(E,R)and_~—«E+ L(E, End(E)) = L(E, L(E, B)). 


In fact, if R € L?(E, L(E, E)), then for each pair of elements v, we E and 
ze, we see that R(v, w) e L(E, E) and R(v, w)z € E, so we get a 3-linear 
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map 
(v, w, z)}> R(v, w)z. 


We shall apply both functors to the tangent bundle in Chapter IX. 


For another type of operation, we have the direct sum (also called the 
Whitney sum) of two bundles «, B over X. It is denoted by «@ f, and 
the fiber at a point x is 


(4 @ B),. = % ® B- 


Of course, the finite direct sum of vector spaces can be identified with 
their finite direct products, but we write the above operation as a direct 
sum in order not to confuse it with the following direct product. 

Let a: E,>X and f:E,—Y be two vector bundles in VB(X) and 
VB(Y) respectively. Then the map 


ax BP: E,x Es >xX x Y 


is a vector bundle, and it is this operation which we call the direct 
product of « and P. 

Let X be a manifold, and A a functor of class C? with p21. The 
tensor bundle of type 4 over X is defined to be A,(T(X)), also denoted by 
AT(X) or T,(X). The sections of this bundle are called tensor fields of 
type 4, and the set of such sections is denoted by I,(X). Suppose that 
we have a trivialization of T(X), say 


T(X)=X xE. 


Then T,(X) = X x A(E). A section of T,(X) in this representation is com- 
pletely described by the projection on the second factor, which is a 
morphism 

f:X aA). 


We shall call it the local representation of the tensor field (in the given 
trivialization). If € is the tensor field having f as its local representation, 
then 


E(x) = (x, fQ)). 


Let f: X > Y be a morphism of class C? (p21). Let w be a tensor 
field of type L’ over Y, which could also be called a multilinear tensor 
field. For each ye Y, w(y) (also written w,) is a continuous multilinear 
function on T,(Y): 
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For each x € X, we can define a continuous multilinear map 
*(o): T, xX: xX T.7R 
by the composition of maps (T,f)' and @,,.): 
T.x-x Tho Fes eK Tree 2 R. 


We contend that the map x+> f*(w) is a tensor field over X, of the same 
type as w. To prove this, we may work with local representation. Thus 
we can-assume that we work with a morphism 


f:U>~V 
of one open set in a Banach space into another, and that 
ao:V—> LF) 


is a morphism, V being open in F. If U is open in E, then f*() (now 
denoting a local representation) becomes a mapping of U into L’(E), 
given by the formula 


So) = L'(f'(x)) @( fo). 


Since L’: L(E, F) > L(L’(F), L'(E)) is of class C®, it follows that f*(@) is 
a morphism of the same class as w. This proves what we want. 

Of course, the same argument is valid for the other functors L{ and Lj 
(symmetric and alternating continuous multilinear maps). Special cases 
will be considered in later chapters. If 4 denotes any one of our three 
functors, then we see that we have obtained a mapping (which is in fact 
linear) 

f* TY) > TX) 


which is clearly functorial in f. We use the notation f* instead of the 
more correct (but clumsy) notation f, or T,(f). No confusion will arise 
from this. 


lll, §5. SPLITTING OF VECTOR BUNDLES 


The next proposition expresses the fact that the VB-morphisms of one 
bundle into another (over a fixed morphism) form a module over the 
ring of functions. 
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Proposition 5.1. Let X, Y be manifolds and fy: X > Y a morphism. Let 
a, B be vector bundles over X, Y respectively, and let f, g:«— B be two 
VB-morphisms over fy. Then the map f + g defined by the formula 


(f+ ds = fe + 9x 


is also a VB-morphism. Furthermore, if w:Y—R is a function on Y, 
then the map wWf defined by 


Wf). = Wfol)) fe 
is also a VB-morphism. 


Proof. Both assertions are immediate consequences of Proposition 
3.10 of Chapter I. 


We shall consider mostly the situation where X = Y and fo is the 
identity, and will use it, together with partitions of unity, to glue VB- 
morphisms together. 

Let «, B be vector bundles over X and let {(U;, W,)} be a partition of 
unity on X. Suppose given for each U; a VB-morphism 


Ff: aU; > B|U;. 

Each one of the maps y;f; (defined as in Proposition 5.1) is a VB- 
morphism. Furthermore, we can extend y;f; to a VB-morphism of « into 
B simply by putting 

(ifs = 0 
for all x ¢ U;. If we now define 

fia Bp 

f,(o) = YW fix 0) 


by the formula 


for all pairs (x,v) with vea,, then the sum is actually finite, at each 
point x, and again by Proposition 5.1, we see that f is a VB-morphism. 
We observe that if each f; is the identity, then f =) yf, is also the 
identity. 


Proposition 5.2. Let X be a manifold admitting partitions of unity. Let 
0>a> 8 be an exact sequence of vector bundles over X. Then there 
exists a surjective VB-morphism g: B >a whose kernel splits at each 
point, such that g o f = id. 
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Proof. By the definition of exact sequence, there exists a partition of 
unity {(U;, w,)} on X such that for each i, we can split the sequence over 
U;. In other words, there exists for each i a VB-morphism 


gi: B|U; > &|U; 


which is surjective, whose kernel splits, and such that g;o f; = id;. We let 
g => Wg; Then g is a VB-morphism of 8 into « by what we have just 
seen, and 


g°f => Wigif, = id. 


It is trivial that g is surjective because go f = id. The kernel of g, splits 
at each point x because it has a closed complement, namely f.«,. This 
concludes the proof. 


If y is the kernel of B, then we have B + a @ y. 

A vector bundle x over X will be said to be of finite type if there 
exists a finite trivialization for x (i.e. a trivialization {(U;,1,)} such that i 
ranges over a finite set). 

If k is an integer = 1 and E a topological vector space, then we 
denote by E* the direct product of E with itself k times. 


Proposition 5.3. Let X be a manifold admitting partitions of unity. Let 
n be a vector bundle of finite type in VB(X,E), where E is a Banach 
space. Then there exists an integer k>0 and a vector bundle « in 
VB(X, E*) such that x @ « is trivializable. 


Proof. We shall prove that there exists an exact sequence 
0-7 a B 

with E,=XxE* Our theorem will follow from the preceding 
proposition. 

Let {(U;,7,)} be a finite trivialization of 2 with i=1, ...,k. Let 
{(U;, W;)} be a partition of unity. We define 

f:E, 7X x EF 
as follows. If x ¢ X and v is in the fiber of E, at x, then 
FW) = (x, Vi)t1 0), --- Wel) te(0)). 

The expression on the right makes sense, because in case x does not lie 


in U, then (x) =0 and we do not have to worry about the expression 
t,(v). If x lies in U;, then t,(v) means 7;,(v). 
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Given any point x, there exists some index i such that (x) >0 and 
hence f is injective. Furthermore, for this x and this index i, f, maps E, 
onto a closed subspace of E*, which admits a closed complement, namely 

Ex: x0x:::xE 


with 0 in the i-th place. This proves our proposition. 


CHAPTER _ IV 


Vector Fields and Differential 
Equations 


In this chapter, we collect a number of results all of which make use of 
the notion of differential equation and solutions of differential equations. 

Let X be a manifold. A vector field on X assigns to each point x of 
X a tangent vector, differentiably. (For the precise definition, see §2.) 
Given X, in X, it is then possible to construct a unique curve a(t) 
starting at XQ, (i.e. such that «(0) = x9) whose derivative at each point is 
the given vector. It is not always possible to make the curve depend on 
time t from —oo to +00, although it is possible if X is compact. 

The structure of these curves presents a fruitful domain of investiga- 
tion, from a number of points of view. For instance, one may ask for 
topological properties of the curves, that is those which are invariant 
under topological automorphisms of the manifold. (Is the curve a closed 
curve, is it a spiral, is it dense, etc.?) More generally, following standard 
procedures, one may ask for properties which are invariant under any 
given interesting group of automorphisms of X (discrete groups, Lie 
groups, algebraic groups, Riemannian automorphisms, ad lib.). 

We do not go into these theories, each of which proceeds according to 
its own flavor. We give merely the elementary facts and definitions 
associated with vector fields, and some simple applications of the exis- 
tence theorem for their curves. 

Throughout this chapter, we assume all manifolds to be Hausdorff, of 
class C” with p = 2 from §2 on, and p = 3 from §3 on. This latter condi- 
tion insures that the tangent bundle is of class C?~! with p — 1 = 1 (or 2). 

We shall deal with mappings of several variables, say f(t, x, y), the first 
of which will be a real variable. We identify D, f(t, x, y) with 


lim S(t aa h, x, y) — f(t, x, y) 


h>0 h 
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IV, §1. EXISTENCE THEOREM FOR 
DIFFERENTIAL EQUATIONS 


Let E be a Banach space and U an open subset of E. In this section we 
consider vector fields locally. The notion will be globalized later, and 
thus for the moment, we define (the local representation of) a time- 
dependent vector field on U to be a C?-morphism (p = 0) 


f:J x UE, 


where J is an open interval containing 0 in R. We think of f as 
assigning to each point x in U a vector f(t, x) in E, depending on time t. 

Let x9 be a point of U. An integral curve for f with initial condition 
Xq is a mapping of class C’ (r = 1) 


a:Jg > U 


of an open subinterval of J containing 0, into U, such that «(0) = xo and 
such that 


a'(t) = f(t, a(t)). 


Remark. Let «: Jo > U be a continuous map satisfying the condition 


a(t) =X9 + ic a(u)) du. 
0 


Then « is differentiable, and its derivative is f(t, «(t)). Hence « is of class 
C'. Furthermore, we can argue recursively, and conclude that if f is of 
class C?, then so is a. Conversely, if « is an integral curve for f with 
initial condition x9, then it obviously satisfies our integral relation. 


Let 
f:JxU-E 


be as above, and let x9 be a point of U. By a local flow for f at x we 
mean a mapping 


a:Jg x UU 


where J, is an open subinterval of J containing 0, and Up is an open 
subset of U containing x), such that for each x in Uy) the map 


a,(t) = a(t, x) 


is an integral curve for f with initial condition x (i.e. such that «(0, x) = 
x). 
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As a matter of notation, when we have a mapping with two argu- 
ments, say (t,x), then we denote the separate mappings in each argu- 
ment when the other is kept fixed by ,(t) and @,(x). The choice of 
letters will always prevent ambiguity. 

We shall say that f satisfies a Lipschitz condition on U uniformly with 
respect to J if there exists a number K > 0 such that 


If, x) -— f@ yl S K|x — yl 


for all x, y in U and t in J. We call K a Lipschitz constant. If f is of 
class C1, it follows at once from the mean value theorem that f is 
Lipschitz on some open neighborhood J) x Up of a given point (0, x9) of 
U, and that it is bounded on some such neighborhood. 

We shall now prove that under a Lipschitz condition, local flows exist 
and are unique locally. In fact, we prove more, giving a uniformity 
property for such flows. If b is real > 0, then we denote by J, the open 
interval —b<t<b. 


Proposition 1.1. Let J be an open interval of R containing 0, and U 
open in the Banach space E. Let x, be a point of U, anda>0,a<1 
a real number such that the closed ball B3,(x9) lies in U. Assume that 
we have a continuous map 


f:JI x U-E 
which is bounded by a constant L = 1 on J x U, and satisfies a Lipschitz 
condition on U uniformly with respect to J, with constant K2 1. If 
b <a/LK, then for each x in B,(x9) there exists a unique flow 
a: J, x B,(xo) > U. 


If f is of class C” (p = 1), then so is each integral curve «,,. 


Proof. _Let I, be the closed interval —b <t <b, and let x be a fixed 
point in B,(x,). Let M be the set of continuous maps 


a: I, > B(x) 


of the closed interval into the closed ball of center x and radius 2a, such 
that «(0) =x. Then M is a complete metric space if we define as usual 
the distance between maps a, B to be 


sup |a(t) — B(O). 
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We shall now define a mapping 
S:M>M 


of M into itself. For each a in M, we let Sa be defined by 


(Sa)(t) = x + | “Teen van 
0 


Then Sa is certainly continuous, we have Sa(0) = x, and the distance of 
any point on Sa from x is bounded by the norm of the integral, which is 
bounded by 


bsup|f(u, y)| S bL <a. 


Thus Sa lies in M. 
We contend that our map S is a shrinking map. Indeed, 


|Sa — SB| < b sup! f(u, au) — f(u, B))| 
< bK\a — Bi, 


thereby proving our contention. 

By the shrinking lemma (Chapter I, Lemma 5.1) our map has a unique 
fixed point a, and by definition, «(t) satisfies the desired integral relation. 
Our remark above concludes the proof. 


Corollary 1.2. The local flow « in Proposition 1.1 is continuous. Fur- 
thermore, the map x +> a, of B,(xo) into the space of curves is continu- 
ous, and in fact satisfies a Lipschitz condition. 


_ Proof. The second statement_obviously implies the first. So fix x in 
B,(x,) and take y close to x in B,(xo). We let S, be the shrinking map of 
the theorem, corresponding to the initial condition x. Then 
Ilor,. _ S,%5[l = || S,.0, _ Syl Ss |x yl. 
Let C=bK so0<C<1. Then 
lla, — Sforg|| S lloee — Syael| + IS,a. — Sya|| + -+° + [|SPta, — Sy, 
<(1+C+-°°+C"")|x — yl. 
Since the limit of S/w, is equal to a, as n goes to infinity, the continuity 


of the map x + «, follows at once. In fact, the map satisfies a Lipschitz 
condition as stated. 
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It is easy to formulate a uniqueness theorem for integral curves over 
their whole domain of definition. 


Theorem 1.3 (Uniqueness Theorem). Let U be open in E and let 
f:U—-E be a vector field of class C’, p21. Let 


a,:J,>U and a2: J,>U 


be two integral curves for f with the same initial condition xy. Then a, 
and a, are equal on J, 0 Jp. 


Proof. Let Q be the set of numbers b such that «,(t) = a,(t) for 
O<st<b. 


Then Q contains some number b > 0 by the local uniqueness theorem. If 
Q is not bounded from above, the equality of «,(t) and «,(t) for all t>0 
follows at once. If Q is bounded from above, let b be its least upper 
bound. We must show that 5 is the right end point of J; 0 J,. Suppose 
that this is not the case. Define curves B, and f, near 0 by 


Bi(t)}=a(b+t) and = B,(t)=4,(b + 2). 


Then 8, and £, are integral curves of f with the initial conditions «,(b) 
and «,(b) respectively. The values f,(t) and £,(t) are equal for small 
negative t because b is the least upper bound of Q. By continuity it 
follows that «,(b) = a,(b), and finally we see from the local uniqueness 
theorem that 


B,(t) = B,(t) 


for all t in some neighborhood of 0, whence a, and a, are equal in a 
neighborhood of b, contradicting the fact that b is a least upper bound of 
Q. We can argue the same way towards the left end points, and thus 
prove our statement. 


For each x € U, let J(x) be the union of all open intervals containing 0 
on which integral curves for f are defined, with initial condition equal to 
x. The uniqueness statement allows us to define the integral curve uniquely 
on all of J(x). 


Remark. The choice of 0 as the initial time value is made for conve- 
nience. From the uniqueness statement one obtains at once (making a 
time translation) the analogous statement for an integral curve defined on 
any open interval; in other words, if J,, J, do not necessarily contain 0, 
and t, is a point in J, \J, such that «,(to) = a (to), and also we have 
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the differential equations 
a()= f(a) and — a(t) = f(a2(0), 
then a, and «, are equal on J, NJ. 
In practice, one meets vector fields which may be time dependent, and 


also depend on parameters. We discuss these to show that their study 
reduces to the study of the standard case. 


Time-dependent vector fields 
Let J be an open interval, U open in a Banach space E, and 
f:J x U-E 


a C? map, which we view as depending on time te J. Thus for each ¢, 
the map x + f(t, x) is a vector field on U. Define 


f:I x USRxE 
by 7 
f(t, x) = (1, f@ x), 


and view f as a time-independent vector field on J x U. Let & be its 
flow, so that 


a’(t, s,x) = f(x(t, s,x)), =(0, s, x) =(s, x). 
We note that & has its values in J x U and thus can be expressed in 
terms of two components. In fact, it follows at once that we can write & 
in the form 
a(t, s, x) =(t + s, %(t, s, x)). 
Then &, satisfies the differential equation 
D,&2(t, S, x) i f(t + S, a(t, S, x)) 
as we see from the definition of f. Let 


Bt x) = X(t, 0, x). 


Then 8 is a flow for f, that is B satisfies the differential equation 


D, Blt, x) = S(t, BC, x), B(O, x) = x. 
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Given x € U, any value of t such that « is defined at (t, x) is also such 
that % is defined at (t, 0, x) because «, and f, are integral curves of the 
same vector field, with the same initial condition, hence are equal. Thus 
the study of time-dependent vector fields is reduced to the study of 
time-independent ones. 


Dependence on parameters 
Let V be open in some space F and let 
giJxVxU—-E 


be a map which we view as a time-dependent vector field on U, also 
depending on parameters in V. We define 


G:JxVxU->FxE 
by 
G(t, z, y) = (0, g(t, z, y)) 


for te J, ze V, and ye U. This is now a time-dependent vector field on 
V x U. A local flow for G depends on three variables, say f(t, z, y), with 
initial condition f(0, z, y) = (z, y). The map B has two components, and it 
is immediately clear that we can write 


B(t, z, y) = (z, a(t, z, y)) 


for some map a depending on three variables. Consequently « satisfies 
the differential equation 


D, a(t, Z, y) = g(t, Z, a(t, Zz, y)), a(0, Z, y) =); 


which gives the flow of our original vector field g depending on the 
parameters zeV. This procedure reduces the study of differential 
equations depending on parameters to those which are independent of 
parameters. 


We shall now investigate the behavior of the flow with respect to its 
second argument, i.e. with respect to the points of U. We shall give two 
methods for this. The first depends on approximation estimates, and the 
second on the implicit mapping theorem in function spaces. 

Let Jy be an open subinterval of J containing 0, and let 


g: Jog > U 
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be of class C'. We shall say that @ is an c-approximate solution of f on 
lr'®-f(t pO) Se 


for all t in Jo. 


Proposition 1.4. Let @, and ~, be two &,- and &,-approximate solutions 
of f on Jo respectively, and let ¢ = &, + &. Assume that f is Lipschitz 
with constant K on U uniformly in Jy, or that D,f exists and is 
bounded by K on J x U. Let ty be a point of Jy. Then for any t in 
Jo, we have 


= é os 
11) — G2(0)| S 11(to) — Pr(to)leXe*l + mo 


Proof. By assumption, we have 


ni) — f(t i) Se, 
lo) — f(t, 20) S &2- 


From this we get 


loi) — 92(0) + f(t g2(0) — f(t gO) Se. 
Say t = ty to avoid putting bars around t — t . Let 


W(t) = |¢,() — e2(0)|, 
w(t) = |f(t, e() — f(t o2(0))- 


Then, after integrating from t) to t, and using triangle inequalities we 
obtain 


I(t) — Wito)] S et — to) + | o(u) du 
Set—to)+K { w(u) du 
<K | " LW(w) + e/K] du, 


and finally the recurrence relation 


W(t) S Wo) + K | [yy(u) + &/K] du. 
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On any closed subinterval of Jo, our map w is bounded. If we add é/K 
to both sides of this last relation, then we see that our proposition will 
follow from the next lemma. 


Lemma 1.5. Let g be a positive real valued function on an interval, 
bounded by a number L. Let to be in the interval, say to <t, and 
assume that there are numbers A, K = 0 such that 


gt) SA+K [ g(u) du. 


to 


Then for all integers n = 1 we have 


ae al) pA oa es or] | EKMt = to)" 


1! (n — 1)! n! 

Proof. The statement is an assumption for n=1. We proceed by 
induction. We integrate from ty to t, multiply by K, and use the recur- 
rence relation. The statement with n+ 1 then drops out of the statement 
with n. 


Corollary 1.6. Let f: J x U-E be continuous, and satisfy a Lipschitz 
condition on U uniformly with respect to J. Let xq be a point of U. 
Then there exists an open subinterval Jo of J containing 0, and an open 
subset of U containing xq such that f has a unique flow 


a:Jg X UU. 


We can select Jy and Uy such that « is continuous and satisfies a 
Lipschitz condition on Jo x Uo. 


Proof. Given x, y in Uy we let @,(t) = a(t, x) and ¢,(t) = a(t, y), using 
Proposition 1.6 to get Jy and Up. Then e,=¢,=0. For s, t in Jy we 
obtain 

Joe(t, x) — a(s, y)| S la(t, x) — a(t, y)| + late, y) — als, y)| 


< |x —yle* + |t—sl|L, 


if we take Jo of small length, and L is a bound for f. Indeed, the term 
containing |x — y| comes from Proposition 1.4, and the term containing 
|t — s| comes from the definition of the integral curve by means of an 
integral and the bound L for f. This proves our corollary. 


Corollary 1.7. Let J be an open interval of R containing 0 and let U be 
open in E. Let f: J x U-E be a continuous map, which is Lipschitz 
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on U uniformly for every compact subinterval of J. Let ty ¢€J and let 
(1, Q2 be two morphisms of class C* such that @,(to) = (to) and 
satisfying the relation 


ot) = f(t, e() 
for all tin J. Then ¢,(t) = 9,(t). 
Proof. We can take ¢ = 0 in the proposition. 


The above corollary gives us another proof for the uniqueness of 
integral curves. Given f:J x U-E as in this corollary, we can define 
an integral curve « for f on a maximal open subinterval of J having a 
given value a(t.) for a fixed ty in J. Let J be the open interval (a, b) and 
let (dg, bg) be the interval on which « is defined. We want to know when 
by = b (or dy = a), that is when the integral curve of f can be continued 
to the entire interval over which f itself is defined. 

There are essentially two reasons why it is possible that the integral 
curve cannot be extended to the whole domain of definition J, or cannot 
be extended to infinity in case f is independent of time. One possibility 
is that the integral curve tends to get out of the open set U, as on the 
following picture: 


This means that as t approaches bo, say, the curve a(t) approaches a 
point which does not lie in U. Such an example can actually be con- 
structed artificially. If we are in a situation when a curve can be ex- 
tended to infinity, just remove a point from the open set lying on the 
curve. Then the integral curve on the resulting open set cannot be 
continued to infinity. The second possibility is that the vector field is 
unbouned. The next corollary shows that these possibilities are the only 
ones. In other words, if an integral curve does not tend to get out of the 
open set, and if the vector field is bounded, then the curve can be 
continued as far as the original data will allow a priori. 


Corollary 1.8. Let J be the open interval (a, b) and let U be open in E. 
Let f: J x U-E be a continuous map, which is Lipschitz on U, uni- 
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formly for every compact subset of J. Let « be an integral curve of f, 
defined on a maximal open subinterval (ag, bo) of J. Assume: 


(i) There exists ¢ > 0 such that «((bp — &, bo)) is contained in U. 
(ii) There exists a number B>0 such that |f(t, «(t))| < B for all t in 
(bo — & bo). 


Then bo = b. 


Proof. From the integral expression for «, namely 


a(t) = a(to) + [ S(u, a(u)) du, 


we see that for t,, t, in (bp — &, by) we have 
jo(t,) — a(t2)| S$ Blt, — ty]. 
From this it follows that the limit 


lim «(t) 


tbo 


exists, and is equal to an element x9 of U (by hypothesis (i)). Assume 
that by 4 b. By the local existence theorem, there exists an integral curve 
B of f defined on an open interval containing by such that B(bo) = xo 
and f’(t) = f(t, B(d). Then B’ =a’ on an open interval to the left of bo, 
and hence a, f differ by a constant on this interval. Since their limit as 
t— by are equal, this constant is 0. Thus we have extended the domain 
of definition of « to a larger interval, as was to be shown. 


The next proposition describes the solutions of linear differential equa- 
tions depending on parameters. 


Proposition 1.9. Let J be an open interval of R containing 0, and let V 
be an open set in a Banach space. Let E be a Banach space. Let 


g:J x V>L(E, E) 
be a continuous map. Then there exists a unique map 
A: J x V—> L(E, E) 
which, for each x € V, is a solution of the differential equation 
D, A(t, x) = g(t, x)A(t, x), A(0, x) = id. 


This map A is continuous. 


[IV, §1] EXISTENCE THEOREM FOR DIFFERENTIAL EQUATIONS 75 


Remark. In the present case of a linear differential equation, it is not 
necessary to shrink the domain of definition of its flow. Note that the 
differential equation is on the space of continuous linear maps. The 
corresponding linear equation on E itself will come out as a corollary. 


Proof of Proposition 1.9. Let us first fix x eV. Consider the differ- 
ential equation 


D, A(t, x) = g(t, x)A(t, x), 


with initial condition A(0, x) = id. This is a differential equation on L(E, E), 
where f(t, z) = g,(t)z for z € L(E, E), and we write g,(t) instead of g(t, x). 
Let the notation be as in Corollary 1.8. Then hypothesis (i) is automati- 
cally satisfied since the open set U is all of L(E, E). On every compact 
subinterval of J, g, is bounded, being continuous. Omitting the index x 
for simplicity, we have 


t 
A(t) = id + | g(u)A(u) du, 
ce) 
whence for t = 0, say, 


|AQH| S1+B i. |A(u)| du. 


Using Lemma 1.5, we see that hypothesis (ii) of Corollary 1.8 is also 
satisfied. Hence the integral curve is defined on all of J. 

We shall now prove the continuity of A. Let (to, x9)e J x V. Let I be 
a compact interval contained in J, and containing tp and 0. As a func- 
tion of t, A(t, Xx) is continuous (even differentiable). Let C >0 be such 
that |A(t, x9)| < C for all te J. Let V, be an open neighborhood of x, in 
V such that g is bounded by a constant K > 0 on I x Vj. 

For (t, x) eI x V,; we have 


A(t, x) — A(Los Xo)| S [AG x) — A(t, Xo)] + IA(E X0) — Alo: Xo). 


The second term on the right is small when t is close to to. We investi- 
gate the first term on the right, and shall estimate it by viewing A(t, x) 
and A(t, X9) as approximate solutions of the differential equation satisfied 
by A(t, x). We find 
|D, A(t, Xo) — g(t, X)A(t, Xo)| 
= |D, A(t, Xo) — g{t, X)A(L Xo) + g(t, Xo)AL, Xo) — Gt Xo) AL Xo) 
S |g(t, Xo) — g(t, XIIIAG Xo) S lg, Xo) — g(t XJIC. 
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By the usual proof of uniform continuity applied to the compact set 
I x {xo}, given ¢>0, there exists an open neighborhood Vy of x con- 
tained in V,, such that for all (t, x) eI x Vo we have 


lg(t, x) — g(t, Xo)| < eC. 


This implies that A(t, xo) is an é-approximate solution of the differential 
equation satisfied by A(t, x). We apply Proposition 1.4 to the two curves 


Po(t) = A(t, Xo) and @,(t) = A(t, x) 


for each xe Vy. We use the fact that A(0, x) = 10, x9) = id. We then 
find 
|A(t, x) — A(t, Xo)| < Ky 


for some constant K, > 0, thereby proving the continuity of 4 at (to, Xo). 


Corollary 1.10. Let the notation be as in Proposition 1.9. For each 
x €V and ze E the curve 


Bit, x, z) = Alt, x)z 


with initial condition B(0,x,z)=z is a solution of the differential 
equation 
D, Bit, x, z) = g(t, x)B(t, x, 2). 


Furthermore, B is continuous in its three variables. 
Proof. Obvious. 


Theorem 1.11 (Local Smoothness Theorem). Let J be an open interval 
in R containing 0 and U open in the Banach space E. Let 


f:J x U-E 


be a C’-morphism with p= 1, and let x»eU. There exists a unique 
local flow for f at x9. We can select an open subinterval Jy of J 
containing 0 and an open subset U, of U containing xq such that the 
unique local flow 


a:Jg xX Up > U 


is of class C”, and such that D,« satisfies the differential equation 


D,D a(t, x) = Dz f(t, a(t, x))D2a(t, x) 


on Jy x Up with initial condition D,«(0, x) = id. 
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Proof. Let 
g.J x U>L&, E) 


be given by g(t, x) =D,f(t, a(t, x)). Select J, and Uy such that a is 
bounded and Lipschitz on J, x Uy (by Corollary 1.6), and such that g is 
continuous and bounded on J, x Uj. Let Jo be an open subinterval of 
J, containing 0 such that its closure Jy is contained in J,. 
Let A(t, x) be the solution of the differential equation on L(E, E) given 
by 
D, A(t, x) = g(t, X)A(t, x), A(0, x) = id, 


as in Proposition 1.9. We contend that D,a exists and is equal to 4 on 
Jo x Uy. This will prove that D,« is continuous, on Jy x Up. 
Fix xe U). Let 


A(t, h) = a(t, x + h) — a(t, x). 
Then 
D, 6(t, h) = Dy a(t, x + h) — D, a(t, x) 
= f(t, a(t, x + h)) — f(t, a(t, x)). 


By the mean value theorem, we obtain 


|D, A(t, h) — g(t, x) A(t, h)| 
= | f(t, a(t, x + h)) — f(t, alt, x)) — Do f(z, a(t, x)) A(t, h)| 
Ss |h|sup|D, f(t, y) 7 Dp f(t, a(t, x))| 
where y ranges over the segment between a(t, x) and a(t,x +h). By the 
compactness of Jp it follows that our last expression is bounded by 


|h|yW(h) where W(h) tends to 0 with h, uniformly for t in Jy. Hence we 
obtain 


|O'(t, h) — g(t, x)O(t, h)| S |hl wh), 


for all t in Jy. This shows that 0(t, h) is an |h|w(h) approximate solution 
for the differential equation satisfied by A(t, x)h, namely 


D, A(t, x)h — g(t, x)A(t, x)h = 0, 
with the initial condition 1(0, x)h =h. We note that @(t, h) has the same 
initial condition, 6(0, h) = h. Taking tp = 0 in Proposition 1.4, we obtain 


the estimate 


|O(t, h) — A(t, x)h| S Cy |hl Wh) 
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for all t in Jo. This proves that D,« is equal to 4 on Jo x Up, and is 
therefore continuous on Jp x Up. 

We have now proved that D,a and D,« exist and are continuous on 
Jy x Up, and hence that a is of class C' on Jy x Up. 

Furthermore, D,« satisfies the differential equation given in the state- 
ment of our theorem on J, x Up. Thus our theorem is proved when 
p=. 


A flow which satisfies the properties stated in the theorem will be 
called locally of class C’. 


Consider now again the linear equation of Proposition 1.9. We re- 
formulate it to eliminate formally the parameters, namely we define a 
vector field 


G:J x V x L(E, E)> F x L(E, E) 
to be the map such that 
G(t, x, w) = (0, g(t, x)a) 


for w € L(E, E). The flow for this vector field is then given by the map A 
such that 


A(t, x, @) = (x, A(t, x)oo). 


If g is of class C1 we can now conclude that the flow A is locally of class 
C', and hence putting w = id, that / is eres of class C}. 

We apply this to the case when g(t, x) = D,f(t, a(t, x)), and to the 
solution D,« of the differential equation 


D, (D2) (t, x) oe gt, x)D, a(t, x) 


locally at each point (0, x), xe U. Let p22 be an integer and assume 
our theorem proved up to p — 1, so that we can assume a locally of class 
C’-!, and f of class C’. Then g is locally of class C?~', whence D,« is 
locally C?~*. From the expression 


D, a(t, x) = f(t, a(t, x)) 
we conclude that D,a is C?~!, whence « is locally C?. 


If f is C*®, and if we knew that « is of class C? for every integer p on 
its domain of definition, then we could conclude that « is C®; in other 
words, there is no shrinkage in the inductive application of the local 
theorem. We shall do this at the end of the section. 
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We shall now give another proof for the local smoothness of the flow, 
which depends on a simple application of the implicit mapping theorem 
in Banach spaces, and was found independently by Pugh and Robbin 
[Ro 68]. One advantage of this proof is that it extends to H? vector 
fields, as noted by Ebin and Marsden [EbM 70]. 

Let U be open in E and let f: U +E be a C? map. Let b> 0 and let 
I, be the closed interval of radius b centered at 0. Let 


F = C°%(I,, E) 


be the Banach space of continuous maps of I, into E. We let V be the 
subset of F consisting of all continuous curves 


o:I1,7U 


mapping J, into our open set U. Then it is clear that V is open in F 
because for each curve o the image o(I,) is compact, hence at a finite 
distance from the complement of U, so that any curve close to it is also 
contained in U. 
We define a map 
T:UxV->F 
by 


Teo)=xt | foors 


i) 


Here we omit the dummy variable of integration, and x stands for the 
constant curve with value x. If we evaluate the curve T(x, o) at t, then 
by definition we have 


T(x, o)(t) =x + | f(o(u)) du — o(t). 
0 


Lemma 1.12. The map T is of class C?, and its second partial deriva- 
tive is given by the formula 


ates 0)= | Df oo —I 
10) 
where I is the identity. In terms of t, this reads 
t 
D, T(x, a)h(t) = | Df(a(u))h(u) du — h(t). 
0 


Proof. It is clear that the first partial derivative D,T exists and is 
continuous, in fact C®, being linear in x up to a translation. To deter- 
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mine the second partial, we apply the definition of the derivative. The 
derivative of the map a+» a is of course the identity. We have to get 
the derivative with respect to o of the integral expression. We have for 
small h 


|| rowe+in—| soo-| (oreo 
) ) 0 


s | Met +W—foo-(f ooh 


We estimate the expression inside the integral at each point u, with u 
between 0 and the upper variable of integration. From the mean value 
theorem, we get 


|f(a(u) + h(u)) — f(o(u)) — Df(o(u))h)| [Ih | sup|Df(z,) — Df(o))| 


where the sup is taken over all points z, on the segment between o(u) 
and o(u)+h(u). Since Df is continuous, and using the fact that the 
image of the curve o(I,) is compact, we conclude (as in the case of 
uniform continuity) that as ||h|| > 0, the expression 


sup|Df(z,) — Df(o(u))| 


also goes to 0. (Put the ¢ and 6 in yourself.) By definition, this gives us 
the derivative of the integral expression in o. The derivative of the final 
term is obviously the identity, so this proves that D,T is given by the 
formula which we wrote down. 

This derivative does not depend on x. It is continuous in o. Namely, 
we have 


D, T(x, t) — D, T(x, 6) = | [Df ot — Df oo]. 
0 


If o is fixed and 7 is close to a, then Df ot — Df oo is small, as one 
proves easily from the compactness of o(J,), as in the proof of uniform 
continuity. Thus D,T is continuous. By Proposition 3.5 of Chapter I, 
we now conclude that T is of class C’. 

The derivative of D,T with respect to o can again be computed as 
before if Df is itself of class C1, and thus by induction, if f is of class C? 
we conclude that D,T is of class C?~' so that by the same reference, we 
conclude that T itself is of class C?. This proves our lemma. 


We observe that a solution of the equation 


T(x, 0) =0 
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is precisely an integral curve for the vector field, with initial condition 
equal to x. Thus we are in a situation where we want to apply the 
implicit mapping theorem. 


Lemma 1.13. Let x, ¢U. Let a> 0 be such that Df is bounded, say by 
a number C, > 0, on the ball B,(x9) (we can always find such a since Df 
is continuous at X9). Let b <1/C,. Then D,T(x, oc) is invertible for all 
(x, 0) in B,(X9) x V. 


Proof. We have an estimate 


| I. Df (o(u))h(u) du} < BC, Ihll. 


This means that 
|\D, T(x, o) + I| < 1, 


and hence that D,T(x, 0) is invertible, as a continuous linear map, thus 
proving Lemma 1.13. 


We are ready to reprove the local smoothness theorem by the present 
means, when p is an integer, namely: 


Theorem 1.14. Let p be a positive integer, and let f: UE be a C? 
vector field. Let Xy¢€U. Then there exist numbers a, b >0 such that 
the local flow 

a: J, x B,(xo) ~ U 
is of class C?. 


Proof. We take a so small and then b so small that the local flow 
exists and is uniquely determined by Proposition 1.1. We then take b 
smaller and a smaller so as to satisfy the hypotheses of Lemma 1.13. We 
can then apply the implicit mapping theorem to conclude that the map 
xt+a, is of class C?. Of course, we have to consider the flow a and still 
must show that « itself is of class C?. It will suffice to prove that D,a 
and D,« are of class C?~', by Proposition 3.5 of Chapter I. We first 
consider the case p = 1. 

We could derive the continuity of « from Corollary 1.2 but we can 
also get it as an immediate consequence of the continuity of the map 
xt+a,. Indeed, fixing (s, y) we have 


|a(t, x) ~ a(s, y)I = |a(t, x) a a(t, yl + |o(t, y) ace a(s, yl 
s \|,. ae Oy + |a,(t) 7% at, (s)|. 


Since a, is continuous (being differentiable), we get the continuity of «. 
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Since 


D, a(t, x) = f(a(t, x), 
we conclude that D,« is a composite of continuous maps, whence 


continuous. 
Let @ be the derivative of the map x b «,, so that 


@: B,(Xo) > L(E, C°(I,, E)) = L(E, F) 
is of class C?-*. Then 
Ox+w — &, = p(x)w + [wl y(w), 
where w(w) > 0 as w— 0. Evaluating at t, we find 
a(t, x + w) — a(t, x) = (p(w) + Iwly(w)(O), 


and from this we see that 


Dz a(t, x)w = (p(x)w) (0). 
Then 


|D, a(t, x)w yas D,«(s, y)w| 
S l(o(X)w) — (9) w)O1 + (E(w) 0 — (@0)w)(5)- 


The first term on the right is bounded by 


le(x) — e(y)I|wl 
so that 
|D, a(t, x) — D a(t, y)| S |e(x) — e(y)|. 


We shall prove below that 


I(o(y)w)® — (@)w)(s)| 


is uniformly small with respect to w when s is close to t. This proves the 
continuity of D,«, and concludes the proof that « is of class C?. 


The following proof that |(g(y)w)(t) — (g(y)w)(s)| is uniformly small 
was shown to be by Professor Yamanaka. We have 


(1) a(t, x) =x + icc x)) du. 
0 
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Replacing x with x + Aw (we E, A £0), we obtain 

t 
(2) a(t, x + Aw) =x + Aw | f(a(u, x + Aw)) du. 

0) 


Therefore 


a(t, x + Aw) — a(t, x) - 


3) ; 


w+ [. us (a(u, x + Aw)) — f(a(u, x))] du. 
0 


On the other hand, we have already seen in the proof of Theorem 1.14 
that 


(4) a(t, x + Aw) — a(t, x) = A(p(x)w)(t) + [All wl paw). 


Substituting (4) in (3), we obtain: 
(oteyeyie + iyi =w + [FEAatu x + 299) — flotu 3) 


=wt ‘i | G(u, 4, v) du du, 
0oJ0 
where 
G(u, A, v) = Df(a(u, x) + ve1(4))((p(x)w)(u) + €2(A)) 


€1(A) = A(g(x)w)(u) + |Allwiyw)(u), — e2(A) = A yanytu. 


with 


Letting 1 +0, we have 


6) (otsywyte) = w + [ dylata 9) (ots)w) (9d 
By (5) we have 
Kocer»)( — (ocars)on s | [D/C 39) (ots) 0 du 
S bC,|e(x)|-|wl-|t — sl, 
from which we immediately obtain the desired uniformity. 


Returning to our main concern, the flow, we have 


a(t, x) =x + {. f(a(u, x)) du. 
0 
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We can differentiate under the integral sign with respect to the parameter 
x and thus obtain 


D, a(t, x) = 1+ [, Df(a(u, x))D,«(u, x) du, 
0 


where I is a constant linear map (the identity). Differentiating with 
respect to t yields the linear differential equation satisfied by D,«, namely 


D,D,a(t, x) = Df(a(t, x))D2«(t, x) 


and this differential equation depends on time and parameters. We have 
seen earlier how such equations can be reduced to the ordinary case. We 
now conclude that locally, by induction, D,« is of class C?~* since Df is 
of class C?~'. Since 


D, a(t, x) = f(a(t, x)), 


we conclude by induction that D,« is C?~'. Hence a is of class C? by 
Proposition 3.5 of Chapter I. Note that each time we use induction, the 
domain of the flow may shrink. We have proved Theorem 1.14, when p 
is an integer. 


We now give the arguments needed to globalize the smoothness. We 
may limit ourselves to the time-independent case. We have seen that the 
time-dependent case reduces to the other. 

Let U be open in a Banach space E, and let f: U >E be a C? vector 
field. We let J(x) be the domain of the integral curve with initial condi- 
tion equal to w. 

Let D(f) be the set of all points (t,x) in R x U such that ¢ lies in 
J(x). Then we have a map 


a: D(f)-U 
defined on all of D(f), letting a(t, x) = a,(t) be the integral curve on J(x) 
having x as initial condition. We call this the flow determined by /, and 
we call D(f) its domain of definition. 
Lemma 1.15. Let f: UE be a C? vector field on the open set U of 
E, and let a be its flow. Abbreviate a(t, x) by tx, if (t,x) is in the 
domain of definition of the flow. Let x €U. If to lies in J(x), then 


J (tox) = J(x) — to 
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(translation of J(x) by —to), and we have for all t in J(x) — to: 
U(tox) = (t + to)x. 
Proof. The two curves defined by 
trea(t,a(to,x)) and treat +to,x) 


are integral curves of the same vector field, with the same initial condi- 
tion tyx at t= 0. Hence they have the same domain of definition J(t)x). 
Hence t¢, lies in J(tox) if and only if t, + to lies in J(x). This proves the 
first assertion. The second assertion comes from the uniqueness of the 
integral curve having given initial condition, whence the theorem follows. 


Theorem 1.16 (Global Smoothness of the Flow). If f is of class C? 
(with p S 00), then its flow is of class C? on its domain of definition. 


Proof. First let p be an integer = 1. We know that the flow is locally 
of class C? at each point (0, x), by the local theorem. Let x € U and let 
J(Xo) be the maximal interval of definition of the integral curve having 
Xo as initial condition. Let D(f) be the domain of definition of the flow, 
and let a be the flow. Let Q be the set of numbers b > 0 such that for 
each t with 0 <t <b there exists an open interval J containing t and an 
open set V containing x9 such that J x V is contained in D(f) and such 
that a is of class C? on Jx V. Then Q is not empty by the local 
theorem. If Q is not bounded from above, then we are done looking 
toward the right end point of J(xo). If Q is bounded from above, we let 
b be its least upper bound. We must prove that b is the right end point 
of J(x). Suppose that this is not the case. Then a(b, x9) is defined. Let 
X, = a(b, Xo). By the local theorem, we have a unique local flow at x,, 
which we denote by f: 


B: J, x B,(x;) ms U, BO, x) =X, 
defined for some open interval J, =(—a,a) and open ball B,(x,) of ra- 
dius a centered at x,. Let 5 be so small that whenever b—6<t <b we 


have 
a(t, Xo) € Byg(x1). 


We can find such 6 because 


lim a(t, Xo) = x, 
t>b 


by continuity. Select a point t, such that b— 6 <t, <b. By the hypoth- 
esis on b, we can select an open interval J, containing t, and an open set 
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U, containing xX, so that 
a: Jy x Uy > Byp(x1) 


maps J, x U, into B,(x,). We can do this because « is continuous at 
(t,, Xo), being in fact C? at this point. If |t—t,|<a and xeU,, we 
define 


v(t, x) = B(t — ty, a(t,, x)). 
Then 


(ty, x) = B(O, a(t, x)) — a(ty, x) 
and 
D, v(t, x) = D, Blt _ t,, a(t, , x)) 


= f(Blt — t, a(t,, »))) 
= f(o(t, »)). 


Hence both ¢, and «a, are integral curves for f with the same value at t,. 
They coincide on any interval on which they are defined by the unique- 
ness theorem. If we take 6 very small compared to a, say 6 < a/4, we see 
that @ is an extension of a to an open set containing (t,,%,), and also 
containing (b, xq). Furthermore, g is of class C”, thus contradicting the 
fact that b is strictly smaller than the end point of J(x,). Similarly, one 
proves the analogous statement on the other side, and we therefore see 
that D(f) is open in R x U and that « is of class C? on D(f), as was to 
be shown. 


The idea of the above proof is very simple geometrically. We go as 
far to the right as possible in such a way that the given flow « is of class 
C? locally at (t,x). At the point a(b, x9) we then use the flow f to 
extend differentiably the flow « in case b is not the right-hand point of 
J(X9). The flow B at a(b, x.) has a fixed local domain of definition, and 
we simply take t close enough to b so that B gives an extension of a, as 
described in the above proof. 

Of course, if f is of class C®, then we have shown that a is of class C? 
for each positive integer p, and therefore the flow is also of class C®. 

In the next section, we shall see how these arguments globalize even 
more to manifolds. 


IV, §2. VECTOR FIELDS, CURVES, AND FLOWS 


Let X be a manifold of class C? with p22. We recall that X is 
assumed to be Hausdorff. Let 2: T(X)— X be its tangent bundle. Then 
T(X) is of class C? 1, p> 1. 
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By a (time-independent) vector field on X we mean a cross section of 
the tangent bundle, ie. a morphism (of class C?~') 


é:X > T(X) 


such that &(x) lies in the tangent space T,(X) for each x € X, or in other 
words, such that zé = id. 

If T(X) is trivial, and say X is an E-manifold, so that we have a 
VB-isomorphism of T(X) with X x E, then the morphism € is completely 
determined by its projection on the second factor, and we are essentially 
in the situaiton of the preceding paragraph, except for the fact that our 
vector field is independent of time. In such a product representation, the 
projection of € on the second factor will be called the local representation 
of €. It is a C?~}-morphism 


f:X ~E 


and &(x) = (x, f(x)). We shall also say that € is represented by f locally if 
we work over an open subset U of X over which the tangent bundle 
admits a trivialisation. We then frequently use € itself to denote this 
local representation. 

Let J be an open interval of R. The tangent bundle of J is then 
J x R and we have a canonical section : such that i(t)= 1 for all te J. 
We sometimes write 7, instead of i(t). 

By a curve in X we mean a morphism (always of class 2 1 unless 
otherwise specified) 


a: J>aX 


from an open interval in R into X. If g: X > Y is a morphism, then go« 
is a curve in Y. From a given curve a, we get an induced map on the 
tangent bundles: 


JxR —* 1(X) 


| | 


J— x 


a 


and a,o1 will be denoted by « or by da/dt if we take its value at a 
point t in J. Thus o’ is a curve in T(X), of class C?~' if « is of class C?. 
Unless otherwise specified, it is always understood in the sequel that we 
start with enough differentiability to begin with so that we never end up 
with maps of class < 1. Thus to be able to take derivatives freely we 
have to take X and « of class C” with p = 2. 
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If g: X > Y is a morphism, then 
(g 0 a)'(t) = g4'(0). 


This follows at once from the functoriality of the tangent bundle and the 
definitions. 

Suppose that J contains 0, and let us consider curves defined on J 
and such that a(0) is equal to a fixed point x9. We could say that two 
such curves a,, #, are tangent at 0 if «,(0) = «,(0). The reader will verify 
immediately that there is a natural bijection between tangency classes of 
curves with a(0)= x, and the tangent space T,,(X) of X at xo. The 
tangent space could therefore have been defined alternatively by taking 
equivalence classes of curves through the point. 

Let € be a vector field on X and x9 a point of X. An integral curve 
for the vector field € with initial condition x., or starting at xo, is a curve 
(of class C?~!) 

a: J >X 


mapping an open interval J of R containing 0 into X, such that «(0) = xo 
and such that 


a(t) = E(a(t)) 


for all te J. Using a local representation of the vector field, we know 
from the preceding section that integral curves exist locally. The next 
theorem gives us their global existence and uniqueness. 


Theorem 2.1. Let «,: J, 2X and a,: J, +X be two integral curves of 
the vector field € on X, with the same initial condition xy. Then «, and 
a, are equal on J, OV Jp. 


Proof. Let J* be the set of points t such that «,(t) = «,(t). Then J* 
certainly contains a neighborhood of 0 by the local uniqueness theorem. 
Furthermore, since X is Hausdorff, we see that J* is closed. We must 
show that it is open. Let t* be in J* and define B,, B, near 0 by 


B,() = a, (t* + 0, 

B2(t) = a2(t* + 0). 
Then f£, and £, are integral curves of € with initial condition «,(t*) and 
a,(t*) respectively, so by the local uniqueness theorem, f, and f, agree 


in a neighborhood of 0 and thus «,, #, agree in a neighborhood of t*, 
thereby proving our theorem. 


It follows from Theorem 2.1 that the union of the domains of all 
integral curves of € with a given initial condition x9 is an open interval 
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which we denote by J(x 9). Its end points are denoted by t*(x,) and 
t” (x9) respectively. (We do not exclude +00 and —oo.) 

Let D(é) be the subset of R x X consisting of all points (t, x) such 
that 


t (x) <t < t*(x). 
A (global) flow for € is a mapping 
a: Df) X, 
such that for each x € X, the map a,: J(x) > X given by 
a(t) = a(t, x) 


defined on the open interval J(x) is a morphism and is an integral curve 
for é with initial condition x. When we select a chart at a point xq of X, 
then one sees at once that this definition of flow coincides with the 
definition we gave locally in the previous section, for the local representa- 
tion of our vector field. 

Given a point x e¢ X and a number t¢, we say that tx is defined if (¢, x) 
is in the domain of a, and we denote a(t, x) by tx in that case. 


Theorem 2.2. Let € be a vector field on X, and « its flows. Let x bea 
point of X. If to lies in J(x), then 


I (tox) = I(x) — to 
(translation of J(x) by —to), and we have for all t in J(x) — to: 
t(tox) — (t + to)x. 


Proof. Our first assertion follows immediately from the maximality 
assumption concerning the domains of the integral curves. The second is 
equivalent to saying that the two curves given by the left-hand side and 
right-hand side of the last equality are equal. They are both integral 
curves for the vector field, with initial condition tgx and must therefore 
be equal. 

In particular, if t;, tf; are two numbers such that t,x is defined and 
t,(t, x) is also defined, then so is (t, + t,)x and they are equal. 


Theorem 2.3. Let € be a vector field on X, and x a point of X. 
Assume that t*(x) < 00. Given a compact set A < X, there exists ¢ >0 
such that for all t>t*(x)—.«, the point tx does not lie in A, and 
similarly for t~. 
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Proof. Suppose such ¢ does not exist. Then we can find a sequence t, 
of real numbers approaching t*(x) from below, such that t,x lies in A. 
Since A is compact, taking a subsequence if necessary, we may assume 
that t,x converges to a point in A. By the local existence theorem, there 
exists a neighborhood U of this point y and a number 6 >0 such that 
t*(z) > 6 for all ze U. Taking n large, we have 


tt(x)< 6 +t, 
and t,x is in U. Then by Theorem 2.2, 


t* (x) = t°(t,x) +t, > 0 + t, > t*(x) 
contradiction. 


Corollary 2.4. If X is compact, and & is a vector field on X, then 
D(E)=Rx X. 


It is also useful to give one other criterion when D(é) = R x X, even 
when X is not compact. Such a criterion must involve some structure 
stronger than the differentiable structure (essentially a metric of some 
sort), because we can always dig holes in a compact manifold by taking 
away a point. 


Proposition 2.5. Let E be a Banach space, and X an E-manifold. Let & 
be a vector field on X. Assume that there exist numbers a>0O and 
K > 0 such that every point x of X admits a chart (U, @) at x such that 
the local representation f of the vector field on this chart is bounded by 
K, and so is its derivative f’. Assume also that @U contains a ball of 
radius a around ox. Then D(é)=R x X. 


Proof. This folows at once from the global continuation theorem, and 
the uniformity of Proposition 1.1. 


We shall prove finally that D(é) is open and that « is a morphism. 


Theorem 2.6. Let & be a vector field of class C?~+ on the C?-manifold 
X (23 pS). Then D(E) is open in R x X, and the flow a for & is a 
C?1-morphism. 


Proof. Let first p be an integer = 2. Let x» eX. Let J* be the set of 
points in J(x 9) for which there exists a number b>0O and an open 
neighborhood U of x, such that (t—b,t+b) U is contained in D(é), 
and such that the restriction of the flow « to this product is a C?™}- 
morphism. Then J* is open in J(x,), and certainly contains 0 by the 
local theorem. We must therefore show that J* is closed in J(xo). 
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Let s be in its closure. By the local theorem, we can select a neigh- 
borhood V of sx = a(s, x.) so that we have a unique local flow 


B:J,x Vox 


for some number a> 0, with initial condition B(0, x)= x for all xe V, 
and such that this local flow B is C?~?. 

The integral curve with initial condition x is certainly continuous on 
J(xo). Thus tx9 approaches sxo as t approaches s. Let V, be a given 
small neighborhood of sx, contained in V. By the definition of J*, we 
can find an element t, in J* very close to s, and a small number b 
(compared to a) and a small neighborhood U of x, such that « maps the 
product 

(t; —b,t; +b)x U 


into V,, and is C?~! on this product. For teJ,+t, and xeU, we 
define 


ot, x) me Blt — th, a(t, > x). 
Then o(t,, x) = B(0, a(t,, x)) = a(t,, x), and 


D, g(t, x) Dy B(t — ty, a(t, x)) 
= (Bt — ty, a(ty, x) 
= E((t, x)). 


Hence both 9,, a, are integral curves for €, with the same value at f,. 
They coincide on any interval on which they are defined, so that @, is a 
continuation of a, to a bigger interval containing s. Since « is C?™ on 
the product (t, — b,t, + b) x U, we conclude that g is also C?~! on 
(J, + t,) x U. From this we see that D(é) is open in R x X, and that « 
is of class C?~1 on its full domain D(é). If p= 00, then we can now 
conclude that « is of class C’ for each positive integer r on D(¢), and 
hence is C®, as desired. 


Corollary 2.7. For each teR, the set of xeX such that (t,x) is 
contained in the domain D(é) is open in X. 


Corollary 2.8. The functions t*(x) and t~(x) are upper and lower semi- 
continuous respectively. 


Theorem 2.9. Let € be a vector field on X and a its flow. Let D,(&) be 
the set of points x of X such that (t, x) lies in D(E). Then D,(E) is open 
for each t ER, and a, is an isomorphism of D,(€) onto an open subset of 
X. In fact, «,(D,) = D_, and a;' = a_,. 


Proof. Immediate from the preceding theorem. 
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Corollary 2.10. If x9 is a point of X and t is in J(xq), then there exists 
an open neighborhood U of Xo such that t lies in J(x) for all x € U, and 
the map 


xr {tx 


is an isomorphism of U onto an open neighborhood of tx. 


Critical points 


Let & be a vector field. A critical point of € is a point x, such that 
&(x9) = 0. Critical points play a significant role in the study of vector 
fields, notably in the Morse theory. We don’t go into this here, but just 
make a few remarks to show at the basic level how they affect the 
behavior of integral curves. 


Proposition 2.11. If « is an integal curve of a C' vector field, €, and a 
passes through a critical point, then a is constant, that is a(t)=Xp9 for 
all t. 


Proof. The constant curve through x, is an integral curve for the 
vector field, and the uniqueness theorem shows that it is the only one. 


Some smoothness of the vector field in addition to continuity must be 
assumed for the uniqueness. For instance, the following picture illustrates 
a situation where the integral curves are not unique. They consist in 
translations of the curve y = x? in the plane. The vector field is continu- 
ous but not locally Lipschitz. 
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Proposition 2.12. Let & be a vector field and « an integral curve for ¢. 
Assume that all t = 0 are in the domain of «a, and that 


lim a(t) = x, 
t>0 


exists. Then x, is a critical point for €, that is €(x,)=0. 


Proof. Selecting t large, we may assume that we are dealing with the 
local representation f of the vector field near x,. Then for t’ >t large, 
we have 


a(t’) — a(t) = | f(a(u)) du. 


Write f(a(u)) = f(x1) + gu), where lim g(u) = 0. Then 
|f(x,)||t’ — t| S |a(t’) — a(2)| + It’ — t|sup|g@)|, 


where the sup is taken for u large, and hence for small values of g(u). 
Dividing by |t’ — t| shows that f(x,) is arbitrarily small, hence equal to 
0, as was to be shown. 


Proposition 2.13. Suppose on the other hand that x is not a critical 
point of the vector field . Then there exists a chart at Xq such that the 
local representation of the vector field on this chart is constant. 


Proof. In an arbitrary chart the vector field has a representation as a 
morphism 


E:USE 


near X9. Let « be its flow. We wish to “straighten out” the integral 
curves of the vector field according to the next figure. 


F x F a(r(x), Px) 
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In other words, let v = (xo). We want to find a local isomorphism 9 at 
Xo such that 


gy (x)v = E(e(x)). 


We inspire ourselves from the picture. Without loss of generality, we 
may assume that x» =0. Let 4 be a functional such that A(v) #0. We 
decompose E as a direct sum 


E=F®@ORz2, 


where F is the kernel of A. Let P be the projection on F. We can write 
any x near 0 in the form 


x = Px + t(x)v, 
where 


We then bend the picture on the left to give the picture on the right 
using the flow a of €, namely we define 


g(x) = a(t(x), Px). 


This means that starting at Px, instead of going linearly in the direction 
of v for a time t(x), we follow the flow (integral curve) for this amount of 
time. We find that 

A 


g' (x) = D,a(z(x), Px) io) 


+ D,a(t(x), Px)P. 


Hence ¢’(0) = id, so by the inverse mapping theorem, g is a local iso- 
morphism at 0. Furthermore, since Pv = 0 by definition, we have 


@'(x)v = Dy a(t(x), Px) = &(9(2)), 


thus proving Proposition 2.13. 


IV, §3. SPRAYS 


Second-order vector fields and differential equations 


Let X be a manifold of class C? with p= 3. Then its tangent bundle 
T(X) is of class C’~', and the tangent bundle of the tangent bundle 
T(T(X)) is of class C?~?, with p—22> 1. 
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Let «a: JX be a curve of class C4 (q < p). A lifting of a into T(X) is 
a curve B: J > T(X) such that 7B =a. We shall always deal with q 2 2 
so that a lift will be assumed of class q— 121. Such lifts always exist, 
for instance the curve « discussed in the previous section, called the 
canonical lifting of «. 

A second-order vector field over X is a vector field F on the tangent 
bundle T(X) (of class C?~') such that, if x: TX — X denotes the canoni- 
cal projection of T(X) on X, then 


nt, oF = id., that is 2,F(v) =v for all v in T(X). 
Observe that the succession of symbols makes sense, because 
n,: TT(X) > T(X) 
maps the double tangent bundle into T(X) itself. 


A vector field F on TX is a second-order vector field on X if and only 
if it satisfies the following condition: Each integral curve B of F is 
equal to the canonical lifting of xB, in other words 


(By = B. 


Here, zB is the canonical projection of B on X, and if we put the 
argument t, then our formula reads 


(xB (t) = B(t) 


for all t in the domain of B. The proof is immediate from the definitions, 
because 


(xBy = 1,8 =1,°F of 


We then use the fact that given a vector ve TX, there is an integral 
curve B = B, with B,(0) = v (initial condition v). 

Let «: J > X be a curve in X, defined on an interval J. We define « 
to be a geodesic with respect to F if the curve 


a: J>TX 
is an integral curve of F. Since na’ =a, that is a’ lies above « in TX, we 
can express the geodesic condition equivalently by stating that « satisfies 
the relation 


x” = F(a’). 


This relation for curves « in X is called the second-order differential 
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equation for the curve «, determined by F. Observe that by definition, if 
B is an integral curve of F in TX, then mf is a geodesic for the second 
order vector field F. 

Next we shall give the representation of the second order vector field 
and of the integral curves in a chart. 


Representation in charts 


Let U be open in the Banach space E, so that T(U)=U x E, and 
T(T(U)) = (U x E) x (Ex E). Then x: U x EU is simply the projec- 
tion, and we have a commutative diagram: 


(U x E)x (Ex E) —% UxE 


| | 


UxE — U 


T 


The map z, on each fiber E x E is constant, and is simply the projection 
of E x E on the first factor E, that is 


Ty (X, v, U, W) = (x, Ud). 
Any vector field on U x E has a local representation 
f:UxE>ExE 


which has therefore two components, f = (f,, f,), each f; mapping U x E 
into E. The next statement describes second order vector fields locally in 
the chart. 


Let U be open in the Banach space E, and let T(U) =U x E be the 
tangent bundle. A C?~*-morphism 


f:UxESEXxE 
is the local representation of a second order vector field on U if and 
only if 

f(x, v) io (v, fr(x, v)). 


The above statement is merely making explicit the relation 2, F = id, 
in the chart. If we write f = (f,, f,), then we see that 


Si (x, v) = v. 
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We express the above relations in terms of integral curves as follows. 
Let B = f(t) be an integral curve for the vector field F on TX. In the 
chart, the curve has two components 


B(t) = (x(0), v(t) EU x E. 


By definition, if f is the local representation of F, we must have 


d dx d 
7 = ($. =) = F(x, v) = (v, falx, v)). 


Consequently, our differential equation can be rewritten in the following 
manner: 
dx 


‘dt = v(t), 


d*x dv _ f(x dx 
ae dt 777 dt? 
which is of course familiar. 


Sprays 


We shail be interested in special kinds of second-order differential equa- 
tions. Before we discuss these, we make a few technical remarks. 

Let s be a real number, and z: E> X be a vector bundle. If v is in E, 
so in E, for some x in X, then sv is again in E, since E, is a vector 
space. We write s, for the mapping of E into itself given by this scalar 
multiplication. This mapping is in fact a VB-morphism, and even a 
VB-isomorphism if s #0. Then 


T (sg) = (Sz)4: T(E) > T(E) 


is the usual induced map on the tangent bundle of E. 
Now let E= TX be the tangent bundle itself. Then our map sry 
satisfies the property 


(Srx)e ° Srrx = Srrx ° (Srx)> 


which follows from the linearity of s;, on each fiber, and can also be 
seen directly from the representation on charts given below. 

We define a spray to be a second-order vector field which satisfies the 
homogeneous quadratic condition: 


SPR 1. For all se R and ve T(X), we have 


F (sv) = (Srx),SF(v). 
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It is immediate from the conditions defining sprays (second-order vec- 
tor field satisfying SPR 1) that sprays form a convex set! Hence if we 
can exhibit sprays over open subsets of Banach spaces, then we can glue 
them together by means of partitions of unity, and we obtain at once the 
following global existence theorem. 


Theorem 3.1. Let X be a manifold of class C? (p23). If X admits 
partitions of unity, then there exists a spray over X. 


Representations in a chart 
Let U be open in E, so that TU = U x E. Then 
TTU =(U x E) x (Ex B), 


and the representations of sry and (Sry), in the chart are given by the 
maps 


Sty: (X, v) (x, sv) and (Stuy: (X, 0, U, W) H (Xx, Sv, U, SW). 


Thus 
Srru © (Stu) g: (%& V, U, W) H (x, sv, su, s?w). 


We may now give the local condition for a second-order vector field F 
to be a spray. 


Proposition 3.2. In a chart Ux E for TX, let f:U x E>Ex E rep- 
resent F, with f =(f,, f2). Then f represents a spray if and only if, for 
all se R we have 


falx, sv) = s7 f(x, v). 


Proof. The proof follows at once from the definitions and the formula 
giving the chart representation of s(srx),. 


Thus we see that the condition SPR 1 (in addition to being a second- 
order vector field), simply means that f, is homogeneous of degree 2 in 
the variable v. By the remark in Chapter I, §3, it follows that f, is a 
quadratic map in its second variable, and specifically, this quadratic map 
is given by 

fr(x, v) = 3D3 f(x, O)(v, v). 


Thus the spray is induced by a symmetric bilinear map given at each 
point x in a chart by 


B(x) = 3D3 f(x, 0). 
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Conversely, suppose given a morphism 
U>L2,,(E,E) given by = x +> B(x) 


from U into the space of symmetric bilinear maps E x E> E. Thus for 
each v, we E the value of B(x) at (v,w) is denoted by B(x; v, w) or 
B(x)(v, w). Define 


Sox, v) = B(x; v, v). 


Then f, is quadratic in its second variable, and the map f defined by 


F(x, v) = (v, B(x; v, v)) = (v, fa v)) 


represents a spray over U. We call B the symmetric bilinear map asso- 
ciated with the spray. 

We recall the trivial fact from linear algebra that the bilinear map B is 
determined purely algebraically from the quadratic map, by the formula 


Biv, w) = 3L AW + w) — Av) — Aw)]. 


We have suppressed the x from the notation to focus on the relevant 
second variable v. Thus the quadratic map and the symmetric bilinear 
map determine each other uniquely. 

The above discussion has been local, over an open set U in a Banach 
space. In Proposition 3.4 and the subsequent discussion of connections, 
we show how to globalize the bilinear map B intrinsically on the mani- 
fold. 


Examples. As a trivial special case, we can always take f,(x, v) = (v, 0) 
to represent the second component of a spray in the chart. 

In the chapter on Riemannian metrics, we shall see how to construct a 
spray in a natural fashion, depending on the metric. 

In the chapter on covariant derivatives we show how a spray gives 
rise to such derivatives. 


Next, let us give the transformation rule for a spray under a change of 
charts, i.e. an isomorphism 


h:U-V. 
On TU, the map Th is represented by a morphism (its vector compo- 


nent) 
H:UxE>EXxE given by H(x, v) = (h(x), h’(x)v). 
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We then have one further lift to the double tangent bundle TTU, and we 
may represent the diagram of maps symbolically as follows: 


(Ux E)x (Ex ER @), (Vx =) x (Ex BE) 


VP sean LP 


UxE ase U2 VxE 
k oa | 
U ——— V 


Then the derivative H’(x, v) is given by the Jacobian matrix operating on 
column vectors (u, w) with u, we E, namely 


oyu ( 2 9 ol Va tO 2 
ae = (roe ) Bi ey (n= (in rea) (a): 


Thus the top map on elements in the diagram is given by 
(H, H’): (x, v, u, w) + (h(x), h’(x)v, h’(x)u, h"(x)(u, v) + h'(x)w). 


For the application, we put u = v because f,(x, v) = v, and w = fy, (x, 0), 
where fy and f, denote the representations of the spray over U and V 
respectively. It follows that fy and f, are related by the formula 


S(A(x), h'(x)v) = (h'(x)v, bh" (x)(v, v) + h'(x)fu,2(%, v)). 
Therefore we obtain: 


Proposition 3.3. Change of variable formula for the quadratic part of a 
spray: 
Fv,2(ho), h’'(x)v) = h" (x), v) +h’) fu, a(x, 0), 


By (h(x); h’(x)v, h’(x)w) = h"(x)(v, w) + h'(x)By(x; v, w). 
Proposition 3.3 admits a converse: 


Proposition 3.4. Suppose we are given a covering of the manifold X by 
open sets corresponding to charts U, V, ..., and for each U we are 
given a morphism 

By: U > L3,,,(E, E) 


which transforms according to the formula of Proposition 3.3 under an 
isomorphism h: U-—V. Then there exists a unique spray whose asso- 
ciated bilinear map in the chart U is given by By. 


Proof. We leave the verification to the reader. 
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Remarks. Note that By(x;v, w) does not transform like a tensor of 
type L2,,(E, E), ie. a section of the bundle L2a(TX, TX). There are 
several ways of defining the bilinear map B intrinsically. One of them is 
via second order bundles, or bundles of second order jets, and to extend 
the terminology we have established previously to such bundles, and even 
higher order jet bundles involving higher derivatives, as in [Po 62]. 
Another way will be done below, via connections. For our immediate 
purposes, it suffices to have the above discussion on second-order differ- 
ential equations together with Proposition 3.3 and 3.4. I used sprays (as 
recommended by Palais) in the earliest version of this book [La 62]. 
In [Lo 69] the bilinear map By is expressed in terms of second order 
jets. For applications to symmetric spaces, see [He 78] (expanded ver- 
sion of the similar book from 1962), and [Lo 69]. See also [Pa 57] for 
early Lie group applications. The basics of differential topology and 
geometry were being established in the early sixties. Cf. the bibliograph- 
ical notes from [Lo 69] at the end of his first chapter. 


Connections 


We now show how to define the bilinear map B intrinsically and directly. 
Matters will be clearer if we start with an arbitrary vector bundle 


p:iE>~X 


over a manifold X. As it happens we also need the notion of a fiber 
bundle when the fibers are not necessarily vector spaces, so don’t have a 
linear structure. Let f: Y > X be a morphism. We say that f (or Y over 
X) is a fiber bundle if f is surjective, and if each point x of X has an 
open neighborhood U, and there is some manifold Z and an isomor- 
phism h: f-1(U) > U x Z such that the following diagram is commuta- 
tive: 


jp" pee 
ie 


Thus locally, f: Y— X looks like the projection from a product space. 
The reason why we need a fiber bundle is that the tangent bundle 


Ty: TEE 


is a vector bundle over E, but the composite f = poz,;: TEX is only 
a fiber bundle over X, a fact which is obvious by picking trivializations 
in charts. Indeed, if U is a chart in X, and if U x F-U is a vector 
bundle chart for E, with fiber F, and Y = TE, then we have a natural 
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isomorphism of fiber bundles over U: 


fu) —————_ (Ux F)x (Ex F) 


Tg Pri2 


Note that U being a chart in X implies that U x EU is a vector 
bundle chart for the tangent bundle TU over U. 

The tangent bundle TE has two natural maps making it a vector 
bundle: 


t,: TE > E is a vector bundle over E; 


T(p): TE > TX is a vector bundle over TX. 


Therefore we have a natural morphism of fiber bundle (not vector bun- 
dle) over X: 


(nz, T(p)): TE*>E@TX ~~ givenby Wr>(n,W,T(p)W) 
for We TE. If We TE with ee E,, then n,We E,, and T(p)We T,X. 
After these preliminaries, we define a connection to be a morphism of 
fiber bundles over X, from the direct sum E @ TX into TE: 
H:E®TX-TE 


such that 
(nz, T(p)) oH = idgerx, 


and such that H is bilinear, in other words H,: E, ® T,X — TE is bilinear. 
Consider a chart U as in the above diagram, so 


TU=UxE and T(U x F)=(U x F) x (Ex F). 
Then our map H has a coordinate representation 
H(x, e, v) = (x, e, Hy (x, e, v), Hy(x,e,v)) for ee F and ve E. 


The fact that (zg, T(p)) 0 H = idggrx implies at once that H,(x, e, v) = v. 
The bilinearity condition implies that for fixed x, the map 


(e, v) r> H,(x, é, v) 
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is bilinear as a map F x EE. We shall therefore denote this map by 
B(x), and we write in the chart 


H(x, e, v) = (x, e, v, B(x)(e, v)) or also (x, e, v, B(x, e, v)). 


Now take the special case when E = TX. We say that the connection 
is symmetric if the bilinear map B is symmetric. Suppose this is the case, 
then we may define the corresponding quadratic map TX ~ TTX by 


Sa(x, v) = B(x, v, v). 
Globally, this amounts to defining a morphism 
F: TX ~ TTX such that F = H o diagonal 
where the diagonal is taken in TX @ TX, in each fiber. Thus 
F(v) = H(v, v) for ve T,X. 


Then F is a vector field on TX, and the condition (z,,7,)°H = id on 
TX ®TX implies that F is a second-order vector field on X, in other 
words, F defines a spray. It is obvious that all sprays can be obtained in 
this fashion. Thus we have shown how to describe geometrically the 
bilinear map associated with a spray. 

Going back to the general case of a vector bundle E unrelated to TX, 
we note that the image of a connection H is a vector subbundle over E. 
Let V denote the kernel of the map T(p): TE> TX. We leave it to the 
reader to verify in charts that V is a vector subbundle of TE over E, and 
that the image of H is a complementary subbundle. One calls V the 
vertical subbundle, canonically defined, and one calls H the horizontal 
subbundle determined by the connection. 


IV, §4. THE FLOW OF A SPRAY AND 
THE EXPONENTIAL MAP 


The condition we have taken to define a spray is equivalent to other 
conditions concerning the integral curves of the second-order vector field 
F. We shall list these conditions systematically. We shall use the follow- 
ing relation. If a: J—+X is a curve, and a, is the curve defined by 
a,(t) = a(st), then 

a’, (t) = sa’(st), 


this being the chain rule for differentiation. 
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If v is a vector in TX, let B, be the unique integral curve of F with 
initial condition v (i.e. such that B,(0) =v). In the next three conditions, 
the sentence should begin with “for each v in TX”. 


SPR 2. A number t is in the domain of B,, if and only if st is in the 
domain of B, and then 


Bs(t) = sB,(st). 


SPR 3. If s, t are numbers, st is in the domain of B, if and only if s is 
in the domain of B,,, and then 


7B,.(s) = ™B,(st). 


SPR 4. A number t is in the domain of B, if and only if 1 is in the 
domain of B,,, and then 


B,(t) = 7B,(1). 
We shall now prove the equivalence between all four conditions. 


Assume SPR 1, and let s be fixed. For all t such that st is in the 
domain of B,, the curve B,(st) is defined and we have 


© (sBa(st)) = 5ySB4(3t) = 545F(By(st) = F(sB,(0). 


Hence the curve sf,(st) is an integral curve for F, with initial condition 
sB,(0) = sv. By uniqueness we must have 


sB,(st) = B,,(t). 
This proves SPR 2. 
Assume SPR 2. Since £, is an integral curve of F for each v, with 
initial condition v, we have by definition 
<v(0) = F(sv). 


Using our assumption, we also have 


Prlt) = 5 (sB,(st) = 5458465 


Put t= 0. Then SPR 1 follows because f,, and B, are integral curves of 
F with initial conditions sv and v respectively. 
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It is obvious that SPR 2 implies SPR 3. Conversely, assume SPR 3. 
To prove SPR 2, we have 


Brat) = (Bax) () = 5 nBy(st) = s(,) (st) = $B(0 


which proves SPR 2. 

Assume SPR 4. Then st is in the domain of B, if and only if 1 is in 
the domain of B,,,, and s is in the domain of 8,, if and only if 1 is in the 
domain of £,,,. This proves the first assertion of SPR 3, and again by 
SPR 4, assuming these relations, we get SPR 3. 

It is similarly clear that SPR 3 implies SPR 4. 


Next we consider further properties of the integral curves of a spray. 
Let F be a spray on X. As above, we let 8, be the integral curve with 
initial condition v. Let D be the set of vectors v in T(X) such that £, is 
defined at least on the interval [0,1]. We know from Corollary 2.7 that 
D is an open set in T(X), and by Theorem 2.6 the map 


v + B,(1) 
is a morphism of D into T(X). We now define the exponential map 


exp: D— X 
to be 
exp(v) = 7B,(1). 


Then exp is a C?-*-morphism. We also call D the domain of the expo- 
nential map (associated with F). 

If x e X and 0, denotes the zero vector in T,, then from SPR 1, taking 
s = 0, we see that F(0,) = 0. Hence 


exp(0,.) = x. 


Thus our exponential map coincides with x on the zero cross section, 
and so induces an isomorphism of the cross section onto X. It will be 
convenient to denote the zero cross section of a vector bundle E over X 
by ¢,(X) or simply ¢X if the reference to E is clear. Here, E is the 
tangent bundle. 

We denote by exp, the restriction of exp to the tangent space T.. 
Thus 


exp,: TI. > X. 
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Theorem 4.1. Let X be a manifold and F a spray on X. Then 
exp,: T. > X 


induces a local isomorphism at 0,, and in fact (exp,), is the identity 
at 0,.. 


Proof. We prove the second assertion first because the main assertion 
follows from it by the inverse mapping theorem. Furthermore, since T, is 
a vector space, it suffices to determine the derivative of exp, on rays, in 
other words, to determine the derivative with respect to t of a curve 
exp,(tv). This is done by using SPR 3, and we find 


d 
a oe = Bro 


Evaluating this at t = 0 and taking into account that f,, has w as initial 
condition for any w gives us 


(exp,),,(0,) = id. 


This concludes the proof of Theorem 4.1. 


Helgason gave a general formula for the differential of the exponential 
map on analytic manifolds [He 61], reproduced in [He 78], Chapter I, 
Theorem 6.5. We shall study the differential of the exponential map in 
connection with Jacobi fields, in Chapter IX, §2. 

Next we describe all geodesics. 


Proposition 4.2. The images of straight segments through the origin in 
T,,, under the exponential map exp,, are geodesics. In other words, if 
v é T, and we let 


a(v, t) = a(t) = exp,(tv), 


then «, is a geodesic. Conversely, let a: J -+ X be a C* geodesic defined 
on an interval J containing 0, and such that «(0)=x. Let «'(0) =v. 
Then a(t) = exp,,(tv). 


Proof. The first statement by definition means that a, is an integral 
curve of the spray F. Indeed, by the SPR conditions, we know that 


a(v, t) = o,(t) = B,(1) = 2B,(t), 


and (zf,)’ = B, is indeed an integral curve of the spray. Thus our asser- 
tion that the curves t +> exp(tv) are geodesics is obvious from the defini- 
tion of the exponential map and the SPR conditions. 
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Conversely, given a geodesic a: J— X, by definition a’ satisfies the 
differential equation 


x”(t) = F(a’(t)). 


The two curves t+> a(t) and tt» exp,(tv) satisfy the same differential 
equation and have the same initial conditions, so the two curves are 
equal. This proves the second statement and concludes the proof of the 
proposition. 


Remark. From the theorem, we note that a C! curve in X is a 
geodesic if and only if, after a linear reparametrization of its interval of 
definition, it is simply t + exp,(tv) for some x and some v. 


We call the map (v, t) a(v, t) the geodesic flow on X. It is defined 
on an open subset of TX x R, with a(v,0)=x if ve T,X. Note that 
since n(sB,(t)) = 7B,(t) for se R, we obtain from SPR 2 the property 


a(sv, t) = a(v, st) 


for the geodesic flow. Precisely, t is in the domain of a,, if and only if st 
is in the domain of a,, and in that case the formula holds. As a slightly 
more precise version of Theorem 4.1 in this light, we obtain: 


Corollary 4.3. Let F be a spray on X, and let x)¢X. There exists 
an open neighborhood U of xo, and an open neighborhood V of 
0,, in TX satisfying the following condition. For every xeU and 
ve Vo T,X, there exists a unique geodesic 


a,:(—2, 2) +X 
such that 
a,(0) = x and a,(0) = v. 


Observe that in a chart, we may pick V as a product 
V=UxV,0)CUXxE 


where V,(0) is a neighborhood of 0 in E. Then the geodesic flow is 
defined on U x V,(0) x J, where J = (—2, 2). We picked (—2, 2) for con- 
creteness. What we really want is that 0 and 1 lie in the interval. Any 
bounded interval J containing 0 and 1 could have been selected in the 
statement of the corollary. Then of course, U and V (or V,(0)) depend 
on J. 
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IV, §5. EXISTENCE OF TUBULAR NEIGHBORHOODS 


Let X be a submanifold of a manifold Y. A tubular neighborhood of X in 
Y consists of a vector bundle 2: E— X over X, an open neighborhood Z 
of the zero section (,X in E, and an isomorphism 


f[:Z-U 


of Z onto an open set in Y containing X, which commutes with ¢: 


IX 


J 


We shall call f the tubular map and Z or its image f(Z) the corre- 
sponding tube (in E or Y respectively). The bottom map j is simply the 
inclusion. We could obviously assume that it is an embedding and define 
tubular neighborhoods for embeddings in the same way. We shall say 
that our tubular neighborhood is total if Z=E. In this section, we 
investigate conditions under which such neighborhoods exist. We shall 
consider the uniqueness problem in the next section. 


Theorem 5.1. Let Y be of class C? (p= 3) and admit partitions of 
unity. Let X be a closed submanifold. Then there exists a tubular 
neighborhood of X in Y, of class C?~?. 


Proof. Consider the exact sequence of tangent bundles: 
0> T(X)> T(Y)|X > N(X)> 0. 
We know that this sequence splits, and thus there exists some splitting 
T(Y)|X = T(X) @ N(X) 


where N(X) may be identified with a subbundle of T(Y)|X. Following 
Palais, we construct a spray € on T(Y) using Theorem 3.1 and obtain the 
corresponding exponential map. We shall use its restriction to N(X), 
denoted by exp|N. Thus 


exp|N: DA N(X)- Y. 


We contend that this map is a local isomorphism. To prove this, we 
may work locally. Corresponding to the submanifold, we have a product 
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decomposition U = U, x U,, with X = U, x 0. If U is open in E, then 
we may take U,, U, open in F,, F, respectively. Then the injection of 
N(X) in T(Y)|X may be represented locally by an exact sequence 


0>U, x F,—>U, x F, x Fy, 
and the inclusion of T(Y)|X in T(Y) is simply the inclusion 
U, x F, x F, > U, x U, x F, x Fy. 


We work at the point (x,,0) in U, x F,. We must compute the deriva- 
tive of the composite map 


U, x F, > U, x U, x F, x F, > Y 


at (x,,0). We can do this by the formula for the partial derivatives. 
Since the exponential map coincides with the projection on the zero cross 
section, its “horizontal” partial derivative is the identity. By Theorem 4.1 
we know that its “vertical” derivative is also the identity. Let 


W =(exp)o@ 
(where @ is simply @ followed by the inclusion). Then for any vector 
(w,,w2) in F, x F, we get 


Di (x, ? 0): (w,, W2) > (wy > 0) + Px,(W2), 


where ,, is the linear map given by g on the fiber over x,. By 
hypothesis, we know that F, x F, is the direct sum of F, x 0 and of the 
image of g,,. This proves that Dy(x,, 0) is a toplinear isomorphism, and 
in fact proves that the exponential map restricted to a normal bundle is a 
local isomorphism on the zero cross section. 

We have thus shown that there exists a vector bundle E > X, an open 
neighborhood Z of the zero section in E, and a mapping f: Z > Y which, 
for each x in C,, is a local isomorphism at x. We must show that Z can 
be shrunk so that f restricts to an isomorphism. To do this we follow 
Godement ([God 58], p. 150). We can find a locally finite open covering 
of X by open sets U; in Y such that, for each i we have inverse isomor- 
phisms 

fi: Z;,7 U; and 9;: U; > Z; 


between U; and open sets Z; in Z, such that each Z; contains a point x 
of X, such that f;, g; are the identity on X (viewed as a subset of both Z 
and Y) and such that f; is the restriction of f to Z;. We now find a 
locally finite covering {V;} of X by open sets of Y such that Vc U,, and 
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let V=|) VY. We let W be the subset of elements ye V such that, if y 
lies in an intersection V1 V,, then g(y) = g,(y). Then W certainly con- 
tains X. We contend that W contains an open subset containing X. 

Let xe X. There exists an open neighborhood G, of x in Y which 
meets only a finite number of V,, say V,,, ...,V;,. Taking G, small enough, 
we can assume that x lies in each one of these, and that G, is contained 
in each one of the sets U;,, ...,U;,,. Since x lies in each V,, ...,V,, it is 
contained in U;,,...,U;, and our maps g;,, ...,g;, take the same value at 
x, namely x itself. Using the fact that f,,, ...,f, are restrictions of f, we 
see at once that our finite number of maps g;,, ...,g;,, must agree on G, if 
we take G, small enough. 

Let G be the union of the G,. Then G is open, and we can define a 
map 


g.Gog(QcZ 


by taking g equal to g; on GOV. Then g(G) is open in Z, and the 
restriction of f to g(G) is an inverse for g. This proves that f, g are 
inverse isomorphisms on G and g(G), and concludes the proof of the 
theorem. 


A vector bundle E— X will be said to be compressible if, given an 
open neighborhood Z of the zero section, there exists an isomorphism 


g. EZ, 


of E with an open subset Z, of Z containing the zero section, which 
commutes with the projection on X: 


E——*—_. z, 


So 


It is clear that if a bundle is compressible, and if we have a tubular 
neighborhood defined on Z, then we can get a total tubular neighbor- 
hood defined on E. We shall see in the chapter on Riemannian metrics 
that certain types of vector bundles are compressible (Hilbert bundles, 
assuming that the base manifold admits partitions of unity). 


IV, §6. UNIQUENESS OF TUBULAR NEIGHBORHOODS 


Let X, Y be two manifolds, and F:R x X > Y a morphism. We shall 
say that F is an isotopy (of embeddings) if it satisfies the following 
conditions. First, for each te R, the map F, given by F,(x) = F(t, x) is an 
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embedding. Second, there exist numbers ty < t, such that F, = F,, for all 
t < ty and F,, = F, for all t>t,. We then say that the interval [to, t, ] is 
a proper domain for the isotopy, and the constant embeddings on the left 
and right will also be denoted by F_,, and F,,, respectively. We say that 
two embeddings f: X > Y and g: X > Y are isotopic if there exists an 
isotopy F, as above such that f = F,, and g =F, (notation as above). 
We write f x g for f isotopic to g. 

Using translations of intervals, and multiplication by scalars, we can 
always transform an isotopy to a new one whose proper domain is 
contained in the interval (0,1). Furthermore, the relation of isotopy 
between embeddings is an equivalence relation. It is obviously symmet- 
ric and reflexive, and for transitivity, suppose fx g and g xh. We can 
choose the ranges of these isotopies so that the first one ends and stays 
constant at g before the second starts moving. Thus it is clear how to 
compose isotopies in this case. 

If sy) <s, are two numbers, and o:R—R is a function (morphism) 
such that o(s) = tp for s Ss and o(s) =t, for s2=s,, and o is monotone 
increasing, then from a given isotopy F, we obtain another one, G, = Fy. 
Such a function o can be used to smooth out a piece of isotopy given 
only on a closed interval. 


Remark. We shall frequently use the following trivial fact: If f,: X - Y 
is an isotopy, and if g: X, > X and h: Y— Y, are two embeddings, then 
the composite map 

hf.g: X, >, 
is also an isotopy. 


Let Y be a manifold and X a submanifold. Let 1: E— X be a vector 
bundle, and Z an open neighborhood of the zero section. An isotopy 
f,:Z—Y of open embeddings such that each f, is a tubular neighbor- 
hood of X will be called an isotopy of tubular neighborhoods. In what 
follows, the domain will usually be all of E. 


Proposition 6.1. Let X be a manifold. Let 1: E- X and 1,: E, > X be 
two vector bundles over X. Let 


f:E-E, 


be a tubular neighborhood of X in E, (identifying X with its zero 
section in E,). Then there exists an isotopy 


fi EOE, 


with proper domain [0,1] such that f, = f and fy is a VB-isomorphism. 
(If f, 2, 1 are of class C” then f, can be chosen of class C?~'.) 
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Proof. We define F by the formula 


F,(e) = t*fi(te) 


for t~40 and ee E. Then F, is an embedding since it is composed of 
embeddings (the scalar multiplications by t, t™! are in fact VB-isomor- 
phism). 

We must investigate what happens at t = 0. 

Given ee E, we find an open neighborhood U, of me over which E, 
admits a trivialization U, x E,. We then find a still smaller open neigh- 
borhood U of ze and an open ball B around 0 in the typical fiber E of 
E such that E admits a trivialization U x E over U, and such that the 
representation f of f on U x B (contained in U x E) maps U x B into 
U, x E,. This is possible by continuity. On U x B we can represent f 
by two morphisms, 


F(x, v) = (9(x, 0), WO, v)) 


and (x, 0) = x while w(x, 0) = 0. Observe that for all t sufficiently small, 
te is contained in U x B (in the local representation). 
We can represent F, locally on U x B as the mapping 


F(x, v) = (g(x, tv), W(x, t0)). 


The map 9 is then a morphism in the three variables x, v, and t even at 
t= 0. The second component of F, can be written 

1 
tw (x, tv) = t+ | D,wW(x, stv): (tv) ds 


0 


and thus t~’ cancels t to yield simply 


1 
| D W(x, stv): v ds. 
0 


This is a morphism in t, even at t = 0. Furthermore, for t = 0, we obtain 
Fo(x, v) = (x, Di (x, 0)v). 


Since f was originally assumed to be an embedding, it follows that 
D,(x, 0) is a toplinear isomorphism, and therefore Fy is a VB-isomor- 
phism. To get our isotopy in standard form, we can use a function 
o:R-—R such that o(t)=0 for t<0 and o(t)=1 for t21, and o is 
monotone increasing. This proves our proposition. 
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Theorem 6.2. Let X be a submanifold of Y. Let 
tm: EX and 1,:£, 7X 


be two vector bundles, and assume that E is compressible. Let f: E— Y 
and g:E,—Y be two tubular neighborhoods of X in Y. Then there 
exists a C?~-isotopy 


fiEoyY 


of tubular neighborhoods with proper domain [0,1] and a VB-isomor- 
phism A: E> E, such that f, = f and fy = gi. 


Proof. We observe that f(E) and g(E,) are open neighborhoods of X 
in Y. Let U= f(f(E)Og(E,)) and let g: EU be a compression. Let 
Ww be the composite map 


9 


E— div, 


U Y 

Ww =(f|U)og. Then wy is a tubular neighborhood, and w(E) is contained 
in g(E,). Therefore gw: E— E, is a tubular neighborhood of the same 
type considered in the previous proposition. There exists an isotopy of 
tubular neighborhoods of X: 


G,: E> E, 


such that G, = g-‘w and Gp is a VB-isomorphism. Considering the isotopy 
gG,, we find an isotopy of tubular neighborhoods 


WEY 


such that ¥, =w and )=gw where w: EE, is a VB-isomorphism. 
We have thus shown that w and gw are isotopic (by an isotopy of 
tubular neighborhoods). Similarly, we see that w and fu are isotopic for 
some VB-isomorphism 


MESE., 


Consequently, adjusting the proper domains of our isotopies suitably, we 
get an isotopy of tubular neighborhoods going from gw to fu, say F,. 
Then F,u7} will give us the desired isotopy from gw to f, and we can 
put 2 = wu! to conclude the proof. 


(By the way, the uniqueness proof did not use the existence theorem 
for differential equations.) 


CHAPTER V 


Operations on Vector Fields 
and Differential Forms 


If E—X is a vector bundle, then it is of considerable interest to investi- 
gate the special operation derived from the functor “multilinear alternat- 
ing forms.” Applying it to the tangent bundle, we call the sections of our 
new bundle differential forms. One can define formally certain relations 
between functions, vector fields, and differential forms which lie at the 
foundations of differential and Riemannian geometry. We shall give the 
basic system surrounding such forms. In order to have at least one 
application, we discuss the fundamental 2-form, and in the next chapter 
connect it with Riemannian metrics in order to construct canonically the 
spray associated with such a metric. 

We assume throughout that our manifolds are Hausdorff, and suffi- 
ciently differentiable so that all of our statements make sense. 


V, §1. VECTOR FIELDS, DIFFERENTIAL 
OPERATORS, BRACKETS 


Let X be a manifold of class C? and @ a function defined on an open set 
U, that is a morphism 
g: UR. 


Let & be a vector field of class C?~!. Recall that 
T.¢: T(U) > TAR) =R 


is a continuous linear map. With it, we shall define a new function to be 
denoted by €g or &(g). (There will be no confusion with this notation 
and composition of mappings.) 
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Proposition 1.1. There exists a unique function Ep on U of class C?™! 
such that 


(Cp)(x) = (Te) E(x). 


If U is open in the Banach space E and & denotes the local representa- 
tion of the vector field on U, then 


(Ce) (x) = G'(x)E(x). 


Proof. The first formula certainly defines a mapping of U into R. The 
local formula defines a C?~!-morphism on U. It follows at once from 
the definitions that the first formula expresses invariantly in terms of the 
tangent bundle the same mapping as the second. Thus it allows us to 
define €p as a morphism globally, as desired. 


Let %? denote the ring of functions (of class C?). Then our operation 
ot &@ gives rise to a linear map 


de: B(U) > F *(U), defined by 6.9 = €@. 


A mapping 
é:R>S 


from a ring R into an R-algebra S is called a derivation if it satisfies the 
usual formalism: Linearity, and 6(ab) = ad(b) + d(a)b. 


Proposition 1.2. Let X be a manifold and U open in X. Let & bea 
vector field over X. If 6, =0, then &(x) =0 for all xe U. Each 6, is a 
derivation of &?(U) into §?~1(U). 


Proof. Suppose (x) #0 for some x. We work with the local repre- 
sentations, and take g to be a continuous linear map of E into R such 
that g(&(x)) 4 0, by Hahn—Banach. Then g’(y) = 9 for all ye U, and we 
see that o’(x)é(x) £0, thus proving the first assertion. The second is 
obvious from the local formula. 


From Proposition 1.2 we deduce that if two vector fields induce the 
same differential operator on the functions, then they are equal. 

Given two vector fields €, 4 on X, we shall now define a new vector 
field [€, 7], called their bracket product. 


Proposition 1.3. Let &, 4 be two vector fields of class C?~1 on X. Then 


there exists a unique vector field [E,n] of class C?~? such that for each 
open set U and function @ on U we have 


[é, nlo = E(n(e)) — n(E(9)). 
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If U is open in E and €, n are the local representations of the vector 
fields, then [&, 4] is given by the local formula 


LE, nl ox) = 9'(x)(n'()E(~) — En). 


Thus the local representation of [é,] is given by 


£2, 1] (x) = n'(x)e(x) — o'(x) n(x). 


Proof. By Proposition 1.2, any vector field having the desired effect 
on functions is uniquely determined. We check that the local formula 
gives us this effect locally. Differentiating formally, we have (using the 
law for the derivative of a product): 


(neyé — (Ce)'n = (g'n)'E — (@'E)n 
='n'E + png — oon — pen. 


The terms involving g” must be understood correctly. For instance, the 
first such term at a point x is simply 


0" (x)(n(~), &() 


remembering that g”(x) is a bilinear map, and can thus be evaluated at 
the two vectors n(x) and (x). However, we know that o”(x) is symmet- 
ric. Hence the two terms involving the second derivative of @ cancel, and 
give us our formula. 


Corollary 1.4. The bracket [é, 7] is bilinear in both arguments, we have 
[é, 1] = —[n, €], and Jacobi’s identity 


Cé, La, €]] + Un, 06, 61] + é £6, 1]] = 0. 


If @ is a function, then 


[¢, on] = (Se)n + eLé, n], 
Lee, n] = eLé, n] — (n@)é. 


Proof. The first two assertions are obvious. The third comes from the 
definition of the bracket. We apply the vector field on the left of the 
equality to a function g. All the terms cancel out (the reader will write 
it out as well or better than the author). The last two formulas are 
immediate. 
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We make some comments concerning the functoriality of vector fields. 
Let 
f[:X oY 


be an isomorphism. Let & be a vector field over X. Then we obtain an 
induced vector field f,€ over Y, defined by the formula 


(fe EF (x) = TH(E()). 
It is the vector field making the following diagram commutative: 


tx 4. ry 


ll lle 


We shall also write f* for (f~'), when applied to a vector field. Thus 
we have the formulas 


LEH Te goe and FH Tf oof. 


If f is not an isomorphism, then one cannot in general define the 
direct or inverse image of a vector field as done above. However, let ¢ 
be a vector field over X, and let 4 be a vector field over Y. If for each 
xe X we have 


TF(E() = n( fF), 
then we shall say that f maps € into y, or that € and yn are f-related. If 
this is the case, then we may denote by f,& the map from f(X) into TY 
defined by the above formula. 


Let €,, €2 be vector fields over X, and let n,, yn, be vector fields over 
Y. If &, is f-related to n; for i= 1, 2 then as maps on f(X) we have 


Sela; C2] = [n, N21. 


We may write suggestively the formula in the form 


Fal1, 62) = Cfeas fy So]: 
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Of course, this is meaningless in general, since f,¢, may not be a vector 
field on Y. When f is an isomorphism, then it is a correct formulation of 
the other formula. In any case, it suggests the correct formula. 

To prove the formula, we work with the local representations, when 
X = U is open in E, and Y = V is open in F. Then é;, y; are maps of U, 
V into the spaces E, F respectively. For x € X we have 


(Fal Es, 21) (2) = fC) (Fa) Ex(%) — €40)E2(0)). 


On the other hand, by assumption, we have 


n( F(x) = f'(X)E) 
so that 
Eni, no (f(x)) = 02(£0))n (FC) — 1 (F@))n2(F0) 
= nl fO)F OE) — mi (SO) F'()Er() 
= (N20 SY (EL) — (mr © f('() E209) 
= f'"()* Sa(x) S10) + £0) S9(%) C3 () 
— f° (x): Fs (x) Sox) — F610) E2(%). 


Since f”(x) is symmetric, two terms cancel, and the remaining two terms 
give the same value as (f,.[¢,, €.])(x), as was to be shown. 


The bracket between vector fields gives an infinitesimal criterion for 
commutativity in various contexts. We give here one theorem of a gen- 
eral nature as an example of this phenomenon. 


Theorem 1.5. Let &, 4 be vector fields on X, and assume that [é, 4] = 
0. Let a and B be the flows for € and n respectively. Then for real 
values t, s we have 


a, 0 B, = Blo &%. 
Or in other words, for any x € X we have 
a(t, B(s, x)) = B(s, a(t, x)), 
in the sense that if for some value of t a value of s is in the domain of 


one of these expressions, then it is in the domain of the other and the 
two expressions are equal. 


Proof. For a fixed value of t, the two curves in s given by the 
right- and left-hand side of the last formula have the same initial condi- 


[V, §1] VECTOR FIELDS, DIFFERENTIAL OPERATORS, BRACKETS 119 


tion, namely «,(x). The curve on the right 
st+ B(s, a(t, x)) 

is by definition the integral curve of 7. The curve on the left 
st a(t, B(s, x)) 


is the image under «, of the integral curve for 4 having initial condition 
x. Since x is fixed, let us denote f(s, x) simply by f(s). What we must 
show is that the two curves on the right and on the left satisfy the same 
differential equation. 


B(s, a(t, x)) 


B(s) 


a(t, x) 


x 


In the above figure, we see that the flow «, shoves the curve on the left 
to the curve on the right. We must compute the tangent vectors to the 
curve on the right. We have 


d 
7s (BIS) = D,a(t, B(s))B'(s) 
= D,a(t, B(s))n(B(s)). 


Now fix s, and denote this last expression by F(t). We must show that if 
G(t) = n(a(t, B(s))), 
F(t) = G(t). 


then 


We have trivially F(0) = G(0), in other words the curves F and G have 
the same initial condition. On the other hand, 


F'(t) = & (a(t, B(s)))D2a(t, B(s))n(B(s)) 


G'(t) = n' (alt, B(s))) Eat, B(S))) 
= €'(a(t, B(s)))n(a(t, B(s))) (because [¢, n] = 0). 


and 
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Hence we see that our two curves F and G satisfy the same differential 
equation, whence they are equal. This proves our theorem. 


Vector fields €, 4 such that [€,7] =0 are said to commute. One can 
generalize the process of straightening out vector fields to a finite number 
of commuting vector fields, using the same method of proof, using Theo- 
rem 1.5. As another application, one can prove that if the Lie algebra of 
a connected Lie group is commutative, then the group is commutative. 
Cf. the section on Lie groups. 


V, §2. LIE DERIVATIVE 


Let 2 be a differentiable functor on Banach spaces. For convenience, 
take 4 to be covariant and in one variable. What we shall say in the rest 
of this section would hold in the same way (with slightly more involved 
notation) if 4 had several variables and were covariant in some and 
contravariant in others. 

Given a manifold X, we can take A(T(X)). It is a vector bundle over 
X, which we denote by T,(X) as in Chapter III. Its sections [',(X) are 
the tensor fields of type 2. 

Let € be a vector field on X, and U open in X. It is then possible to 
associate with € a map 


T,(U) 91) 


(with a loss of two derivatives). This is done as follows. 

Given a point x of U and a local flow a for € at x, we have for each t 
sufficiently small a local isomorphism «, in a neighborhood of our point 
x. Recall that locally, «7 = «_,. If 4 is a tensor field of type 4, then the 
composite mapping 7 o a, has its range in 7,(X). Finally, we can take the 
tangent map T(a_,) = (a_,), to return to 7,(X) in the fiber above x. We 
thus obtain a composite map 


F(t, x) = (4 1)4 0 © (x) = (a) (x), 


which is a morphism, locally at x. We take its derivative with respect to 
t and evaluate it at 0. After looking at the situation locally in a trivial- 
ization of T(X) and T,(X) at x, one sees that the map one obtains gives 
a section of T,(U), that is a tensor field of type 2 over U. This is our 
map £,. To summarize, 


d 
Ln = at er (4) 5 ono m,. 
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This map %, is called the Lie derivative. We shall determine the Lie 
derivative on functions and on vector fields in terms of notions already 
discussed. 

First let @ be a function. Then by the general definition, the Lie 
derivative of this function with respect to the vector field € with flow « is 
defined to be 


ae | 
F(x) = lim -Lo(a(t, »)) — 0] 
t7- 
or in other words, 


d 
Lg = —(t9) 


t=0 


F(t) = e(a(t, x)). 


F'(t) = g' (a(t, x))D, a(t, x) 
= g' (a(t, x))E(a(t, x)), 


Our assertion is then that 


To prove this, let 


Then 


because a is a flow for &. Using the initial condition at t = 0, we find 
that 
F'(0) = o'(x)¢(x), 


which is precisely the value of €@ at x, thus proving our assertion. 


If &, n are vector fields, then 


As before, let « be a flow for €. The Lie derivative is given by 


d 
= —_(y* 
LN Fae: nl 
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Letting € and yn denote the local representations of the vector fields, we 
note that the local representation of (a*7)(x) is given by 


(an)(x) = F(t) = Do(—t, x)n(a(t, x). 


We must therefore compute F’(f), and then F’(0). Using the chain rule, 
the formula for the derivative of a product, and the differential equation 
satisfied by D,a, we obtain 


F(t) = —D, D,a(— t, x)n(a(t, x)) + D,a(— I, x)n' (a(t, x))D, a(t, x) 
= —€(a(—t, x))Dya(—t, x)n(a(t, x)) + Dza(—t, x)n'(a(t, x)). 


Putting t = 0 proves our formula, taking into account the initial conditions 


a(0, x) = x and D,«(0, x) = id. 


V, §3. EXTERIOR DERIVATIVE 


Let X be a manifold. The functor L’ (r-multilinear continuous alter- 
nating forms) extends to arbitrary vector bundles, and in particular, to 
the tangent bundle of X. A differential form of degree r, or simply an 
r-form on X, is a section of L7(T(X)), that is a tensor field of type L’. If 
X is of class C”, forms will be assumed to be of a suitable class C* with 
1<s<p-—1. The set of differential forms of degree r will be denoted 
by ./'(X) (& for alternating). It is not only a vector space over R but a 
module over the ring of functions on X (of the appropriate order of 
differentiability). If @ is an r-form, then c(x) is an element of L‘(T,(X)), 
and is thus an r-multilinear alternating form of T,(X) into R. We some- 
times denote w(x) by o,. 

Suppose U is open in the Banach space E. Then L’(T(U)) is equal to 
U x L‘(E) and a differential form is entirely described by the projection 
on the second factor, which we call its local representation, following our 
general system (Chapter III, §4). Such a local representation is therefore 
a morphism 


wo: U > L(E). 


Let w be in L{(E) and »,, ...,v, elements of E. We denote the value 
c(v,,.--,0,) also by 


{@, 0, X °° X U,>. 


Similarly, let €,, ...,€, be vector fields on an open set U, and let w be 
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an r-form on X. We denote by 


«Qo, a rr Ged 


the mapping from U into R whose value at a point x in U is 


(a(x), 61(%) x 1 x 6(x)). 


Looking at the situation locally on an open set U such that T(U) is 
trivial, we see at once that this mapping is a morphism (i.e. a function on 
U) of the same degree of differentiability as w and the €;. 


Proposition 3.1. Let x, be a point of X and w an r-form on X. If 


Ko, 1 X +" X C4) (Xo) 


is equal to 0 for all vector fields €,, ...,€, at Xo (i.e. defined on some 
neighborhood of X 9), then w(x) = 0. 


Proof. Considering things locally in terms of their local representa- 
tions, we see that if w(x )) is not 0, then it does not vanish at some 
r-tuple of vectors (v,,...,v,). We can take vector fields at x) which take 
on these values at x) and from this our assertion is obvious. 


It is convenient to agree that a differential form of degree 0 is a 
function. In the next proposition, we describe the exterior derivative of 
an r-form, and it is convenient to describe this situation separately in the 
case of functions. 

Therefore let f: X >R be a function. For each xe X, the tangent 
map 

Tf: T(X) > Tr.(R) = R 


is a continuous linear map, and looking at local representations shows at 
once that the collection of such maps defines a 1-form which will be 
denoted by df. Furthermore, from the definition of the operation of 
vector fields on functions, it is clear that df is the unique 1-form such 
that for every vector field € we have 


(af, o> = ¢f. 
To extend the definition of d to forms of higher degree, we recall that 
if 
@:U > LIE) 


is the local representation of an r-form over an open set U of E, then for 
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each x in U, 
w’ (x): E> Li(E) 


is a continuous linear map. Applied to a vector v in E, it therefore gives 
rise to an r-form on E. 


Proposition 3.2. Let w be an r-form of class C?~' on X. Then there 
exists a unique (r + 1)-form dw on X of class C?~* such that, for any 
open set U of X and vector fields €),...,€, on U we have 


Xda, Ey x ++: xX &,» 
= FM Eken by xo KA x ED 


+ ¥ (=U a, [Ei 61 x &o x 1 x Gx x Ey x x |). 
i<j 


If furthermore U is open in E and a, &o, ...,€, are the local represen- 
tations of the form and the vector fields respectively, then at a point x 
the value of the expression above is equal to 


Y, (We EiCe), Eola) x ++ x £160) G00). 


Proof. As before, we observe that the local formula defines a differ- 
ential form. If we can prove that it gives the same thing as the first 
formula, which is expressed invariantly, then we can globalize it, and we 
are done. Let us denote by S, and S, the two sums occurring in the 
invariant expression, and let L be the local expression. We must show 
that S,; + S, = L. We consider S,, and apply the definition of €; oper- 
ating on a function locally, as in Proposition 1.1, at a point x. We 
obtain 


=F (MK 0, bo x x 8 xo x EYED 


The derivative is perhaps best computed by going back to the definition. 
Applying this definition directly, and discarding second order terms, we 
find that S, is equal to 
$ ? “N 
D (— Vio" (X)E(%), Sol) x ++ x E(x) x ++ x E00) 
3 “N 
+ EE (0 Lob), Gol) +++ GOVE) x 1+ EG oo x El) 


+ YY KH), Eo) x 1+ x GO) x +1 x EEO) x 0 x G(X). 


t j<i 
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Of these three sums, the first one is the local formula L. As for the 
other two, permuting j and i in the first, and moving the term ¢;(x)é,(x) 
to the first position, we see that they combine to give (symbolically) 


=D L(V Mo, (GS; — 618) x Go x x x KG KK ED 
i i<j 
(evaluated at x). Using Proposition 1.3, we see that this combination is 
equal to —S,. This proves that S, + S, = L, as desired. 


We call dw the exterior derivative of w. Leaving out the order of 
differentiability for simplicity, we see that d is an R-linear map 


d: f"(X) > f"*"(X). 


We now look into the multiplicative properties of d with respect to the 
wedge product. 

Let w, w be continuous multilinear alternating forms of degree r and s 
respectively on the Banach space E. In multilinear algebra, one defines 
their wedge product as an (r + s)-continuous multilinear alternating form, 
by the formula 


1 
(o A Wr, Snare »Urts) = ris! > E(a)a(ve, es Vor) Vor +1)> cee Ver+s)) 


the sum being taken over all permutations o of (1,...,r +s). This defini- 
tion extends at once to differential forms on a manifold, if we view it as 
giving the value for w A w at a point x. The v; are then elements of the 
tangent space T,, and considering the local representations shows at once 
that the wedge product so defined gives a morphism of the manifold X 
into L’*s(T(X)), and is therefore a differential form. 


Remark. The coefficient 1/r!s! is not universally taken to define the 
wedge product. Some people, e.g. [He 78] and [KoN 63], take 1/(r + s)!, 
which causes constants to appear later. I have taken the same factor as 
[AbM 78] and [GHL 87/93]. I recommend that the reader check out 
the case with r=s=1 so r+s=2 to see how a factor 4 comes in. 
With either convention, the wedge product between forms is associative, 
so with some care, one can carry out a consistent theory with either 
convention. I leave the proof of associativity to the reader. It follows by 
induction that if w,, ...,@,, are forms of degrees r,, ...,r,, respectively, 
and t=r,+-°::+7,,, then 


(@, Av N ym) (Vy 5 ++ 50,) 
1 
Be erray (91; seo Ver, )M2 (Vor, +1)orrr> Vo(r,+r>)) ree Drm (Vo(r—rg +1) ogin'9: Ver) 
m: oO 


r,!- 


where the sum is taken over all permutations of (1,...,r). 
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If we regard functions on X as differential forms of degree 0, then the 
ordinary product of a function by a differential form can be viewed as 
the wedge product. Thus if f is a function and a differential form, 
then 

fo=fao. 


(The form on the left has the value f(x)@(x) at x.) 
The next proposition gives us more formulas concerning differential 
forms. 


Proposition 3.3. Let @, W be differential forms on X. Then 
EXD 1. d(w a W) = dan w +(—1)88@ vn dp. 
EXD 2. ddw = 0 (with enough differentiability, say p = 4). 


Proof. This is a simple formal exercise in the use of the local formula 
for the local representation of the exterior derivative. We leave it to the 
reader. 

When the manifold is finite dimensional, then one can give a local 
representation for differential forms and the exterior derivative in terms 
of local coordinates, which are especially useful in integration which fits 
the notation better. We shall therefore carry out this local formulation in 
full. It dates back to Cartan [Ca 28]. There is in addition a theoretical 
point which needs clarifying. We shall use at first the wedge /\ in two 
senses. One sense is defined as above, giving rise to Proposition 3.3. 
Another sense will come from Theorem A. We shall comment on their 
relation after Theorem B. 

We recall first two simple results from linear (or rather multilinear) 
algebra. We use the notation E” = E x E x -:- x E, r times. 


Theorem A. Let E be a finite dimensional vector space over the reals of 
dimension n. For each positive integer r with 1 <r <n there exists a 
vector space /\’ E and a multilinear alternating map 


EO +N E 


denoted by (u,,...,U,) > Uy A+: AUu,, having the following property: 
If {v,,...,0,} is a basis of E, then the elements 


{VeRO ND Sig sr Shy 
form a basis of VE. 


We recall that alternating means that u, A--- A u,=0 if u,; =u, for 
some i #j. We call /’ E the r-th alternating product (or exterior prod- 
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uct) on E. If r= 0, we define \° E=R. Elements of /’ E which can be 
written in the form u, A -*: A u, are called decomposable. Such elements 
generate /\'E. If r > dim E, we define /V E = {0}. 


Theorem B. For each pair of positive integers (r,s), there exists a 
unique product (bilinear map) 


NEx NEONYE 
such that if uy, ...,U,, Wy, ---,W, € E then 
(u, Act 1 A U,) X (Wy Act A Wy) Uy A AU, A Wy Att A Wy. 
This product is associative. 


The proofs for these two statements can be found, for instance, in my 
Linear Algebra. 


Let EY be the dual space, EY = L(E,R). If E=R" and 4,, ...,A, are 
the coordinate functions, then each A; is an element of the dual space, 
and in fact {A,,...,4,} is a basis of this dual space. Let E=R". There 
is an isomorphism 


NV EY > LY(E, R) 


given in the following manner. If g,, ....g,€ EY and v,, ...,v,€E, then 


the value 
det(g (v;)) 


is multilinear alternating both as a function of (g,,...,g,) and (v,,...,0,). 
Thus it induces a pairing 


NEY x E'oR 
and a map 
IN EY > LUE, R). 


This map is the isomorphism mentioned above. Using bases, it is easy to 
verify that it is an isomorphism (at the level of elementary algebra). 

Thus in the finite dimensional case, we may identify L7(E, R) with the 
alternating product /’ EY, and consequently we may view the local rep- 
resentation of a differential form of degree r to be a map 


@:U > EY 
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from U into the rth alternating product of EY. We say that the form is 
of class C? if the map is of class C?. (We view A’ EY as a normed 
vector space, using any norm. It does not matter which, since all norms 
on a finite dimensional vector space are equivalent.) The wedge product 
as we gave it, valid in the infinite dimensional case, is compatible with 
the wedge product and the isomorphism of A’ E with Li(E,R) given 
above. If we had taken a different convention for the wedge product of 
alternating forms, then a constant would have appeared in front of the 
above determinant to establish the above identification (e.g. the constant 
4in the 2 x 2 case). 
Since {2,,...,4,} is a basis of EY, we can express each differential 
form in terms of its coordinate functions with respect to the basis 
{Ai Ac AALS; (i, <*+' <i,), 


namely for each x e U we have 
O(x) =P fii QA, AoA ALS 
@ 


where fi = f;,...;, is a function on U. Each such function has the same 
order of differentiability as w. We call the preceding expression the 
standard form of w. We say that a form is decomposable if it can be 
written as just one term f(x)A;, A -:: A 4,,. Every differential form is a 
sum of decomposable ones. 

We agree to the convention that functions are differential forms of 
degree 0. 

As before, the differential forms on U of given degree r form a vector 
space, denoted by /(U). 

Let E=R". Let f be a function on U. For each xe U the derivative 


f(x): RK" R 
is a linear map, and thus an element of the dual space. Thus 
f:. U7>EY 
represents a differential form of degree 1, which is usually denoted by df. 
If f is of class C”, then df is class C?™'. 
Let 1; be the i-th coordinate function. Then we know that 


dA;(x) = A(x) = A; 


for each x € U because J’(x) = A for any continuous linear map 4. When- 
ever {x,,...,x,} are used systematically for the coordinates of a point 
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in R", it is customary in the literature to use the notation 
di,(x) = dx;. 
This is slightly incorrect, but is useful in formal computations. We shall 


also use it in this book on occasions. Similarly, we also write (incor- 
rectly) 


o=Y fi dx; Ao A dx;, 
@ 
instead of the correct 
@(x) = > S)Ar, Att AA. 


In terms of coordinates, the map df (or f’) is given by 


Af (x) = f'(x) = Dy fay + + Dif O)An» 


where D, f(x) = 6f/dx; is the i-th partial derivative. This is simply a 
restatement of the fact that if h = (h,,...,h,) is a vector, then 


IN iets a TA a 
f (x) = ax + + ax, tm" 
Thus in old notation, we have 
df(x) = ae apo oe of dx,,. 
Ox, OX, 


We shall develop the theory of the alternating product and the exte- 
rior derivative directly without assuming Propositions 3.2 or 3.3 in the 
finite dimensional case. 

Let w and w be forms of degrees r and s respectively, on the open set 
U. For each x € U we can then take the alternating product w(x) A w(x) 
and we define the alternating product @ A wy by 


(@ A W)(x) = @(x) A WO). 


(It is an exercise to verify that this product corresponds to the product 
defined previously before Proposition 3.3 under the isomorphism between 
L'(E, R) and the r-th alternating product in the finite dimensional case.) 
If f is a differential form of degree 0, that is a function, then we have 
again 


fro= fo, 
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where (fw) (x) = f(x)@(x). By definition, we then have 


on fl =fonry. 


We shall now define the exterior derivative dw for any differential form 
@. We have already done it for functions. We shall do it in general first 
in terms of coordinates, and then show that there is a characterization 
independent of these coordinates. If 


o= Shs di;, Act N di, , 
(i) 


we define 
do = ¥ df 0 dai, A++: A da,,. 
a) 


Example. Suppose n = 2 and q is a 1-form, given in terms of the two 
coordinates (x, y) by 


w(x, y) = f(x, y) dx + g(x, y) dy. 
Then 


da(x, y) = df(x, y) A dx + dg(x, y) A dy 


_ (oF of og ag 

= (Yax+ Lay) dx + (Max May) ads 
_ of og 

reg ee aay ee 


because the terms involving dx A dx and dy A dy are equal to 0. 
Proposition 3.4. The map d is linear, and satisfies 
dia a W=do rnw4+(—-Ilo a dp 


if r= deg qa. The map d is uniquely determined by these properties, and 
by the fact that for a function f, we have df = f’. 


Proof. The linearity of d is obvious. Hence it suffices to prove the 
formula for decomposable forms. We note that for any function f we 
have 

d(fo) =df na+ f do. 


Indeed, if w is a function g, then from the derivative of a product we get 
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d(fg) = f dg + g df. If 
w=gdi; n***a di,, 
where g is a function, then 


d( fa) = d(fg dd;, ++: A dd;) = d(fg) a di; N-** A dd; 
=(fdg+gdf)adi;,, n°: A di; 
=fdo+dfnaa, 


as desired. Now suppose that 


o=fds, n-- Adi, and w=gdi, n-- A dij, 
= fo, = gi, 


with i, <-+- <i, and j; <-:: <j, as usual. If some i, =j,, then from the 
definitions we see that the expressions on both sides of the equality 
in the theorem are equal to 0. Hence we may assume that the sets 
of indices i;, ...,i, and j,, ...,j, have no element in common. Then 
d(é a W) = 0 by definition, and 


don p= fod np) = df Gay 
=(gdf + fdghrn any 
=dornwt+fdgnaaw 
=donw+(-lfarndgaw 
=donrw+(—Dl'on dy, 


thus proving the desired formula, in the present case. (We used the fact 
that dg 0 6 =(—1)'@ A dg whose proof is left to the reader.) The for- 
mula in the general case follows because any differential form can be 
expressed as a sum of forms of the type just considered, and one can 
then use the bilinearity of the product. Finally, d is uniquely determined 
by the formula, and its effect on functions, because any differential form 
is a sum of forms of type f dd; A-:- A dd; and the formula gives an 
expression of d in terms of its effect on forms of lower degree. By 
induction, if the value of d on functions is known, its value can then be 
determined on forms of degree = 1. This proves our assertion. 


Proposition 3.5. Let w be a form of class C?. Then ddw = 0. 
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Proof. If f is a function, then 


and 


Using the fact that the partials commute, and the fact that for any two 
positive integers r, s we have dx, A dx, = —dx, A dx,, we see that the 
preceding double sum is equal to 0. A similar argument shows that the 
theorem is true for 1-forms, of type g(x) dx; where g is a function, and 
thus for all 1-forms by linearity. We proceed by induction. It suffices to 
prove the formula in general for decomposable forms. Let w be decom- 
posable of degree r, and write 


o=nnry, 


where deg y = 1. Using the formula for the derivative of an alternating 
product twice, and the fact that dd =0 and ddy =0 by induction, we 
see at once that ddw = 0, as was to be shown. 

We conclude this section by giving some properties of the pull-back of 
forms. As we saw at the end of Chapter III, §4, if f: X > Y is a mor- 
phism and if w is a differential form on Y, then we get a differential form 
f*(@) on X, which is given at a point x e X by the formula 


f*(@); = D(x) 2. (Tf), 


if w is of degree r. This holds for r= 1. The corresponding local repre- 
sentation formula reads 


<f *e0(x), E1(x) X + x EO) = CO( S09), FEL) * + x FED) 
if €,, ...,€, are vector fields. 


In the case of a 0-form, that is a function, its pull-back is simply the 
composite function. In other words, if @ is a function on Y, viewed as a 


form of degree 0, then 
f*@=0f. 


It is clear that the pull-back is linear, and satisfies the following 
properties. 
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Property 1. If w, w are two differential forms on Y, then 
f*(o rn W) = f*@) a fF). 
Property 2. If @ is a differential form on Y, then 
df *(w) = f* (da). 


Property 3. If f: X > Y and g: YZ are two morphisms, and @ is a 
differential form on Z, then 


I*(g*(@)) = (go f*(@). 


Finally, in the case of forms of degree 0: 
Property 4. If f: X — Y is a morphism, and g is a function on Y, then 


d(g o f) = f*(dg) 


and at a point x € X, the value of this 1-form is given by 


Then 9 Tf = (dg) 2 Tf. 
The verifications are all easy, and even trivial, except possibly for 
Property 2. We shall give the proof of Property 2 in the finite dimen- 


sional case and leave the general case to the reader. 
For a form of degree 1, say 


oy) = g(y) dy, 


with y, = f,(x), we find 


(f* doo)(x) = (9'(F0)) © °C) 0 df, (0. 


Using the fact that ddf, = 0, together with Proposition 6 we get 


(df * co) (x) = (d(g 0 f))(x) 0 afi), 


which is equal to the preceding expression. Any 1-form can be expressed 
as a linear combination of form g; dy;, so that our assertion is proved for 
forms of degree 1. 

The general formula can now be proved by induction. Using the 
linearity of f*, we may assume that w is expressed as w = W A n where 
Ww, 1 have lower degree. We apply Proposition 3.3 and Property 1 to 


f*do = f*(dp an) +(—WF*W A dn) 
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and we see at once that this is equal to df*w, because by induction, 
f*dy = df*y and f*dyn = df*n. This proves Property 2. 


Example 1. Let y,, ...,¥, be the coordinates on V, and let py; be the 
jth coordinate function, j = 1, ...,m, so that y; = uj(y1,.--,Ym). Let 


f:UnV 
be the map with coordinate functions 


: yj, = FX) = yo f(*). 
I 
oy) = g(y) dyj, A+++ A dy;, 


is a differential form on V, then 


f*o = (90 fl dfi, N° 0 afi, 


Indeed, we have for x € U: 


(f*e)(x) = g(F0)) (Hy, 0 £00) Ao A (uy, 0 £09) 
Ff) = (uj 0 FY CD) = Wye F'(x) = Af,0). 


and 


Example 2. Let f:[a,b]—R? be a map from an interval into the 
plane, and let x, y be the coordinates of the plane. Let t be the coordi- 
nate in [a,b]. A differential form in the plane can be written in the form 


(x, y) = g(x, y) dx + h(x, y) dy, 


where g, h are functions. Then by definition, 


i d 
felt) = glx), yO) dt + M(x(0), yl) dt, 


if we write f(t) = (x(t), y(t). Let G=(g,h) be the vector field whose 
components are g and h. Then we can write 


S*o(t) = G(f(O)- f'O at, 


which is essentially the expression which is integrated when defining the 
integral of a vector field along a curve. 
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Example 3. Let U, V be both open sets in n-space, and let f: U > V 
be a C? map. If 


oy) = gly) dy, A+++ A dyn; 
where y,; = fj(x) is the j-th coordinate of y, then 


dy; = D, f(x) dx, + ++: + D, f(x) ax, 


oy; 
d 
OX, Xn 


= 3 fot 


and consequently, expanding out the alternating product according to the 
usual multilinear and alternating rules, we find that 


S*o(x) = GSC) Ap) dx, Ao A dx, 


where A, is the determinant of the Jacobian matrix of f. 


V, §4. THE POINCARE LEMMA 
If w is a differential form on a manifold and is such that dw = 0, then it 
is customary to say that w is closed. If there exists a form w such that 
@ = dy, then one says that w is exact. We shall now prove that locally, 
every closed form is exact. 
Theorem 4.1 (Poincaré Lemma). Let U be an open ball in E and let w 
be a differential form of degree 21 on U such that do =0. Then 
there exists a differential form W on U such that dy = a. 


Proof. We shall construct a linear map k from the r-forms to the 
(r — 1)-forms (r = 1) such that 


dk + kd = id. 
From this relation, it will follow that whenever dw = 0, then 
dkow = a, 


thereby proving our proposition. We may assume that the center of the 
ball is the origin. If w is an r-form, then we define kw by the formula 


1 
<(ka@),, 0, X °°" X U,-4> | t’ 1<a(tx), x x vy X** X v,_-,> dt. 
0 
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We can assume that we deal with local representations and that vu; € E. 
We have 


<(dko),,0, X ++" X v,> 


SS ai 
i=1 


1 
y(-vit | t’*<@(tx), 0; X vy X*+ xX 6; x +++ x v,> dt 
0 


1 
+ ¥ (-1)7! | ta’ (tx)v;, x X vy X00 X OX + x v,> dt. 
: 0 


On the other hand, we also have 
<(kdov)(x), vy x ++ x v,> 


1 
a} t"(da(x), x x vy X +++ x v,> dt 


1 
-| ta’ (tx)x, vy X +++ X v,> dt 


i) 
; 1 
+ Ein | t"<o'(tx)u;, x X vy X ++ x 6; xX +++ X v,> dt. 
0 
We observe that the second terms in the expressions for kdw and 
dkw occur with opposite signs and cancel when we take the sum. As to 


the first terms, if we shift v; to the i-th place in the expression for dko, 
then we get an extra coefficient of (—1)'*!. Thus 


1 


dkw + kdw = | rt’-*@(tx), vy X ++ xX v,> dt 
0 


1 
+| ta’ (tx)x, vy X ++ X 0,» dt. 


0 


This last integral is simply the integral of the derivative with respect 
to t of 


<t’@(tx), vy X +++ xX U,>. 
Evaluating this expression between t = 0 and t = 1 yields 
(x), 0 X *** X B,) 


which proves the theorem. 


[V, §5] CONTRACTIONS AND LIE DERIVATIVE 137 


We observe that we could have taken our open set U to be star- 
shaped instead of an open ball. For more information on the relation- 
ship between closed and exact forms, see Chapter XIII, §1. 


V, §5. CONTRACTIONS AND LIE DERIVATIVE 


Let ¢ be a vector field and let w be an r-form on a manifold X, r= 1. 
Then we can define an (r — 1)-form C,@ by the formula 


(C.@)(x)(v2 gore »0,) = o(E(x), v2, “ss 52,), 


for v2, ...,v,€ T,. Using local representations shows at once that C,w 
has the appropriate order of differentiability (the minimum of @ and ©). 
We call C,@ the contraction of @ by ¢, and also denote C,@ by 


@moé, 


If f is a function, we define C,f = 0. Leaving out the order of differ- 
entiability, we see that contraction gives an R-linear map 


C: of"(X) > oh" (X). 

This operation of contraction satisfies the following properties. 

CON 1. Cz0C;=0. 

CON 2. The association (&, @) + C,@ = @0 € is bilinear. It is in fact 
bilinear with respect to functions, that is if @ is a function, 
then 

Coz = GC; and Cpa) = eCzo. 

CON 3. If a, W are differential forms and r = deg a, then 

Co a p) = (C.0) AW +(-lo a Czy. 
These three properties follow at once from the definitions. 
Example. Let X = R", and let 


(x) = dx, A+*: A dX,. 


If € is a vector field on R", then we have the local representation 


(@ o €)(x) = y (—1) *1E,(x) dx, Act A dx, Att A AXpq: 
i=1 
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We also have immediately from the definition of the exterior derivative, 


Hoot) = & 9 


i=1 Ox; 


dx, Av** A AX, 


letting € = (€,,...,€,) in terms of its components €;. 


We can define the Lie derivative of an r-form as we did before for 
vector fields. Namely, we shall evaluate the following limit: 


1 
(La) (x) = lite Ps [ (ae) (x) — w(x)], 
or in other words, 


d 
L.0 = qe) 


t=0 


where « is the flow of the vector field €, and we call £, the Lie derivative 
again, applied to the differential form w. We may rewrite this definition 
in terms of the value on vector fields €,, ...,¢, as follows: 


d 
(F.m)(¢,, reg Ge) = ne OO, OnE y xX Oa,» 


t=0 
Proposition 5.1. Let € be a vector field and w a differential form of 


degreer 21. The Lie derivative Y, is a derivation, in the sense that 


LOE sy 18) = (LONE rs) + Y OF so Phir) 


where of course %.€; = ([€, €;]. 

If €, €:, w denote the local representations of the vector fields 
and the form respectively, then the Lie derivative L,@ has the local 
representation 


((-Ze@)(x), 61() x +++ x 0) 
= C@"(x)E(x), 61 (%) x 7+ x E(x) 


+ y (x), Ey(x) x + x E(x)E{(x) x ++ x E(x). 


Proof. The proof is routine using the definitions. The first assertion is 
obvious by the definition of the pull back of a form. For the local 
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expression we actually derive more, namely we derive a local expression 
for a*w@ and nee which are characterized by their values at (€,,..., €,). 
So we let 
(1) F(t) = <a) (x), €1() x +++ x E00) 
= <o(a(t, x)), Dy, a(t, x)E4 (x) x" Xx D, a(t, x)E,(x)>. 

Then the Lie derivative (Y.«)(x) is precisely F’(0), but we obtain also 

the local representation for ne: 
. d 

(2) F@= € ogfon(x), 1(x) x +++ x 0) = 
3) Ceo (a(t, x) Dy a(t, x), Dpa(t, x)Eq(x) x + x Dealt, x)G,(x)) 


+ 3 <co(a(t, x)), Daa(t, x)E,(x) x +++ x Dy Dya(t, x)E(x) x +++ x Dy alt, x)E,(x)> 


by the rule for the derivative of a product. Putting t=0 and using the 
differential equation satisfied by D,a(t, x), we get precisely the local ex- 
pression as stated in the proposition. Remember the initial condition 
D,«(0, x) = id. 


From Proposition 5.1, we conclude that the Lie derivative gives an 
R-linear map 


L,: A" (X) > A" (X). 


We may use expressions (1) and (3) in the above proof to derive a 
formula which holds even more generally for time-dependent vector fields. 


Proposition 5.2. Let €, be a time-dependent vector field, a its flow, and 
let w be a differential form. Then 


d d 
ne) = at( Lo) or qe) = at (La) 


for a time-independent vector field. 
Proof. Proposition 5.1 gives us a local expression for (%,,«)(y), re- 


placing x by y because we shall now put y = a(t, x). On the other hand, 
from (1) in the proof of Proposition 5.1, we obtain. 


otf (L,0)(x) = ((-Lz,0)(y), Dat, x)E1(x) x ++ x Daae(t, x)E-(x)>. 
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Substituting the local expression for (¥%,@)(y), we get expression (3) from 
the proof of Proposition 5.1, thereby proving Proposition 5.2. 


Proposition 5.3. As a map on differential forms, the Lie derivative 
satisfies the following properties. 


LIE 1. ZY =doCe+ Czod, so Y= Czod on functions. 

LIE 2. Zion p)=Laanpt+oan Ly. 

LIE 3. Y. commutes with d and C;. 

LIE 4. Ye = L0G, -— G0 %:. 

LIE 5. Ce = Leo C, — Cyo &. 

LIE 6. Y,.0 = fL.0 + df Co for all forms w and functions f. 


Proof. Let €,, ...,é, be vector fields, and wm an r-form. Using the 
definition of the contraction and the local formula of Proposition 5.1, we 
find that C,dw is given locally by 


(Czdeo(x), $1 (x) x +** x &,(x)> 
= Ca'(x) E(x), E1(%) x ++ x €-(x)> 


+ 3 (“10 E08), EO) Ex) EG) 7 GO). 
On the other hand, dC, is given by 
(dC 001%), Ex(8) x -** X EC) 
= YH 1 2(C yey EC), El) EG) x > >. 


To compute (C,@)'(x) is easy, going back to the definition of the deriva- 
tive. At vectors v,, ...,v,-,, the form C,a(x) has the value 


<o(x), (x) x U4 xo xX Ov, -1 ». 


Differentiating this last expression with respect to x and evaluating at a 
vector h we get 


<ar'(x)h, E(x) x vy X11 X Bd + CO(X), SOA X Vy X11 X Opa? 


Hence <dCza(x), €,(x) x «+ x €,(x)> is equal to 


¥ (HDF oo, E00) x Ey(0) x x EG) x x G59) 


r . oe 
+ > (—1)'**Ceo(x), EX E(x) x Ex) x 1 & E(x) Ko K E(x). 
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Shifting &’(x)é(x) to the i-th place in the second sum contributes a sign 
of (—1)'! which gives 1 when multiplied by (—1)'*!. Adding the two 
local representations for dC,;@ and C, dq, we find precisely the expres- 
sion of Proposition 5.1, thus proving LIE 1. 

As for LIE 2, it consists in using the derivation rule for d and C, in 
Proposition 3.3, EXD 1, and CON 3. The corresponding rule for &, 
follows at once. (Terms will cancel just the right way.) The other prop- 
erties are then clear. 


V, §6. VECTOR FIELDS AND 1-FORMS 
UNDER SELF DUALITY 


Let E be a Banach space and let 
(v, w) > Xv, w> 


be a continuous bilinear function of E x ER. We call such a function 
a bilinear form. This form induced a linear map 


A: E> EY 
which to each v € E associates the functional 4, such that 
A,(w) = Cv, w). 


We have a similar map on the other side. If both these mappings are 
toplinear isomorphisms of E and EY then we say that the bilinear form 
is non-singular. If such a non-singular form exists, then we say that E is 
self-dual. For instance, a Hilbert space is self-dual. 

If E is finite dimensional, it suffices for a bilinear form to be non- 
singular that its kernels on the right and on the left be 0. (The kernels 
are the kernels of the associated maps 4 as above.) However, in the 
infinite dimensional case, this condition on the kernels is not sufficient 
any more. 

Let E be a self dual Banach space with respect to the non-singular 
form (v, w)  <v, w>, and let 


Q:ExE>R 
be a continuous bilinear map. There exists a unique operator A such 
that 
Q(v, w) = ¢Ad, w> 


for all v, we E. (An operator is a continuous linear map by definition.) 


142 OPERATIONS ON VECTOR FIELDS [V, §6] 


Remarks. Suppose that the form (v, w) ++ (v, w> is symmetric, i.e. 
<v, Ww) = Cw, v> 


for all v, we E. Then Q is symmetric (resp. alternating) if and only if A is 
symmetric (resp. skew-symmetric). Recall that A symmetric (with respect 
to < , >) means that 


<Av, w> = <v, Aw» for all v,wekE. 


That A is skew-symmetric means that <Av, w) = —<Aw,v)> for all », 
weE. For any operator A:E-E there is another operator ‘A (the 
transpose of A with respect to the non-singular form < , )) such that for 
all v, we E we have 


<Av, w> = <0, ‘Aw). 


Thus A is symmetric (resp. skew-symmetric) if and only if ‘A = A (resp. 
‘A = —A). 


The above remarks apply to any continuous bilinear form Q. For 
invertibility, we have the criterion: 


The form Q is non-singular if and only if the operator A representing 
the form with respect to < , > is invertible. 


The easy verification is left to the reader. Of course, in the finite dimen- 
sional case, invertibility or non-singularity can be checked by verifying 
that the matrix representing the linear map with respect to bases has 
non-zero determinant. Similarly, the form is also represented by a matrix 
with respect to a choice of bases, and its being non-singular is equivalent 
to the matrix representing the form being invertible. 

We recall that the set of invertible operators in Laut(E) is an open 
subset. Alternatively, the set of non-singular bilinear forms on E is an 
open subset of L?(E). 

We may now globalize these notions to a vector bundle (and eventu- 
ally especially to the tangent bundle) as follows. 

Let X be a manifold, and x: E— X a vector bundle over X with fibers 
which are toplinearly isomorphic to E, or as we shall also say, modeled 
on E. Let Q be a tensor field of type L? on E, that is to say, a section of 
the bundle L?(E) (or L?(z)), or as we shall also say, a bilinear tensor field 
on E. Then for each x eX, we have a continuous bilinear form Q, 
on E,. 

If Q, is non-singular for each x e X then we say that Q is non-singular. 
If x is trivial, and we have a trivialisation X x E, then the local represen- 
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tation of Q can be described by a morphism of X into the Banach space 
of operators. If Q is non-singular, then the image of this morphism is 
contained in the open set of invertible operators. (If Q is a 2-form, this 
image is contained in the submanifold of skew-symmetric operators.) For 
example, in a chart U, we can represent Q over U by a morphism 


A: U > L(E, E) such that Q,(v, w) = (A,.0, w> 


for all v, we E. Here we wrote A, instead of A(x) to simplify the 
typography. 

A non-singular Q as above can be used to establish a linear 
isomorphism 


T(E) > TL'(E), also denoted by IL(E) or TE’, 


between the R-vector spaces of sections I'(E) of E and the 1-forms on E 
in the following manner. Let & be a section of E. For each xe X we 
define a continuous linear map 


(Qo ¢),: FE, +R 
by the formula 
(Q.0 £),(w) = Q,(E (x), ). 


Looking at local trivialisations of 2, we see at once that No € is a 1-form 
on E. 

Conversely, let @ be a given 1-form on E. For each xe X, @, is 
therefore a 1-form on E, and since Q is non-singular, there exists a 
unique element &(x) of E, such that 


0,,(€(x), @) = @,(w) 
for all we E,. In this fashion, we obtain a mapping € of X into E and 
we contend that é is a morphism (and therefore a section). 
To prove our contention we can look at the local representations. We 


use © and @ to denote these. They are represented over a suitable open 
set U by two morphisms 


A: U = Aut(E) and n:U>E 
such that 


Q,.(v, w) = <A,v, w> and —@,(w) = C(x), w). 
From this we see that 


E(x) = Ay'n(), 
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from which it is clear that € is a morphism. We may summarize our 
discussion as follows. 


Proposition 6.1. Let X be a manifold and xn: E—X a vector bundle 
over X modeled on EK. Let Q be a non-singular bilinear tensor field on 
E. Then Q induces an isomorphism of &(X)-modules 


TE>TEY. 
A section € corresponds to a 1-form w if and only if Q0€=a. 


In many applications, one takes the differential form to be df for some 
function f. The vector field corresponding to df is then called the gradi- 
ent of f with respect to Q. 


Remark. There is no universally accepted notation to denote the cor- 
respondence between a 1-form and a vector field under Q as above. 
Some authors use sharps and flats, which have two disadvantages. First, 
they do not provide a symbols for the mapping, and second they do not 
contain the Q in the notation. I would propose the check sign Vg to 
denote either isomorphism 


Ve: TL(E) 3 TE denoted on elements by or Vea=o" =€, 
and also 
Ve: TE>TL(E) denoted on elements by Er Vob = €Y =a. 


If Q is fixed throughout a discussion and need not be referred to, then it 
is useful to write €Y or AY in some formulas. We have V/go Vo = id. 
Instead of the sharp and flat superscript, I prefer the single ’ sign. 

Many important applications of the above duality occur when Q is a 
non-singular symmetric bilinear tensor field on the tangent bundle TX. 
Such a tensor field is then usually denoted by g. If &, 4 are vector fields, 
we may then define their scalar product to be the function 


Co, >, = GCE, N). 


On the other hand, by the duality of Proposition 6.1, if ie. @, A are 
1-forms, i.e. sections of the dual bundle T’ X, then wY and 4” are vector 
fields, and we define the scalar product of the 1-forms to be 


{@, A>, = (@, AY). 


This duality is especially important for Riemannian metrics, as in Chap- 
ter X. 
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The rest of this section will not be used in the book. 

In Proposition 6.1, we dealt with a quite general non-singular bilinear 
tensor field on E. We now specialize to the case when E = TX is the 
tangent bundle of X, and Q is a 2-form, ie. Q is alternating. A pair 
(X, Q) consisting of a manifold and a non-singular closed 2-form is called 
a symplectic manifold. (Recall that closed means dQ = 0.) 

We denote by €, 7 vector fields over X, and by f, h functions on X, so 
that df, dh are 1-forms. We let &,, be the vector field on X which 
corresponds to df under the 2-forms Q, according to Proposition 6.1. 
Vector fields on X which are of type &, are called Hamiltonian (with 
respect to the 2-form). More generally, we denote by €,, the vector field 
corresponding to a 1-form w. By definition we have the formula 


Q 1. Qoé, =a so in particular Qo la, = af. 


In Chapter VII, §6 we shall consider a particularly important example, 
when the base manifold is the cotangent bundle; the function is the 
kinetic energy 


K(v) = 340, v), 


with respect to the scalar product g of a Riemannian or pseudo Rieman- 
nian metric, and the 2-form © arises canonically from the pseudo Rie- 
mannian metric. 

In general, by LIE 1 of Proposition 5.3 formula Q 1, and the fact that 
dQ = 0, we find for any 1-form w that: 


22. £0 = do. 


The next proposition reinterprets this formula in terms of the flow when 
dw = 0. 


Proposition 6.2. Let w be such that dw =0. Let a be the flow of ¢q. 
Then a*Q =Q for all t (in the domain of the flow). 


Proof. By Proposition 5.2, 


d 
qe = afH,QA=0 by 22. 


Hence «*Q is constant, equal to a§Q = Q, as was to be shown. 

A special case of Proposition 6.2 in Hamiltonian mechanics is when 
w = dh for some function h. Next by LIE 5, we obtain for any vector 
fields &, n: 


LQ) = (LQ) on + Qe [E, 1]. 
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In particular, since ddf = 0, we get 
Q 3. £2, (Q © Can) = Qo [oar> Cand. 


One defines the Poisson bracket between two functions f, h to be 


{fh} = Say h. 


Then the preceding formula may be rewritten in the form 


QD 4. [Sars San] = Cats.h}- 


It follows immediately from the definitions and the antisymmetry of the 
ordinary bracket between vector fields that the Poisson bracket is also 
antisymmetric, namely 


{fh} = —{h, fh. 


In particular, we find that 
Car Sf =0. 


In the case of the cotangent bundle with a symplectic 2-form as in the 
next section, physicists think of f as an energy function, and interpret 
this formula as a law of conservation of energy. The formula expresses 
the property that f is constant on the integral curves of the vector field 
Cas: This property follows at once from the definition of the Lie deriva- 
tive of a function. Furthermore: 


Proposition 6.3. If €4,:h = 0 then €4,°f =0. 


This is immediate from the antisymmetry of the Poisson bracket. It is 
interpreted as conservation of momentum in the physical theory of Ham- 
iltonian mechanics, when one deals with the canonical 2-form on the 
cotangent bundle, to be defined in the next section. 


V, §7. THE CANONICAL 2-FORM 


Consider the functor E++ L(E) (continuous linear forms). If E> X is a 
vector bundle, then L(E) will be called the dual bundle, and will be 
denoted by EY. For each xe X, the fiber of the dual bundle is simply 
L(E,,). 

If E = T(X) is the tangent bundle, then its dual is denoted by TY (X) 
and is called the cotangent bundle. Its elements are called cotangent 
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vectors. The fiber of T’(X) over a point x of X is denoted by T,’(X). 
For each x e X we have a pairing 

T,” x T,>7R 
given by 

<A, ud = Au) 


for Ae T,’ and ve f, (it is the value of the linear form 4 at u). 

We shall now describe how to construct a canonical 1-form on the 
cotangent bundle T’(X). For each Ae T’(X) we must define a 1-form 
on T,(T ’(X)). 

Let x: TY (X)— X be the canonical projection. Then the induced tan- 
gent map 

Tx = 1,: T(T ‘(X)) > T(X) 


can be applied to an element z of T,(T ’(X)) and one sees at once that 
n,Z lies in T,(X) if A lies in T.’(X). Thus we can take the pairing 


<A, TyZ> = 6,(z) 
to define a map (which is obviously continuous linear): 
6,: T,(T ¥(X)) oR. 


Proposition 7.1. This map defines a 1-form on T’(X). Let X =U be 
open in E and 


TY(U)=UxEY,  T(TY(U)) =(U x EY) x (Ex E’). 


If (x,A)eEU x EY and (u,w)eE x E’, then the local representation 
x,» iS given by 
(Ox, 4), (u, w)> ae A(u). 


Proof. We observe that the projection 2: U x EY >U is linear, and 
hence that its derivative at each point is constant, equal to the projection 
on the first factor. Our formula is then an immediate consequence of the 
definition. The local formula shows that 0@ is in fact a 1-form locally, and 
therefore globally since it has an invariant description. 


Our 1-form is called the canonical 1-form on the cotangent bundle. We 
define the canonical 2-form © on the cotangent bundle T’ X to be 


Q= —dé. 


The next proposition gives a local description of Q. 
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Proposition 7.2. Let U be open in E, and let Q be the local representa- 
tion of the canonical 2-form on TYU =U x EY. Let (x,A)eU x EY. 
Let (u,, w,) and (u,, w,) be elements of E x EY. Then 


(Qe. 4) (uy, Wi) X (Uz, W2)> = (uy, W2> — (un, Wy» 


= W2(u,) — Wy (U2). 


Proof. We observe that @ is linear, and thus that @’ is constant. We 
then apply the local formula for the exterior derivative, given in Proposi- 
tion 3.2. Our assertion becomes obvious. 


The canonical 2-form plays a fundamental role in Lagrangian and 
Hamiltonian mechanics, cf. [AbM 78], Chapter 3, §3. I have taken the 
sign of the canonical 2-form both so that its value is a 2 x 2 determi- 
nant, and so that it fits with, for instance, [LoS 68] and [AbM 78]. 
We observe that © is closed, that is dQ =0, because Q = —d0é. Thus 
(T ’X,Q) is a symplectic manifold, to which the properties listed at the 
end of the last section apply. 

In particular, let € be a vector field on X. Then to € is associated a 
function called the momentum function 


fe: T’X +R suchthat = f-(A,) = 4, (E(x)) 


for A,¢ T.’X. Then df, is a 1-form on T’X. Classical Hamiltonian 
mechanics then applies Propositions 6.2 and 6.3 to this situation. We 
refer the interested reader to [LoS 68] and [AbM 78] for further infor- 
mation on this topic. For an important theorem of Marsden—Weinstein 
[MaW 74] and applications to vector bundles, see [Ko 87]. 


V, §8. DARBOUX’S THEOREM 


If E = R” then the usual scalar product establishes the self-duality of R”. 
This self-duality arises from other forms, and in this section we are 
especially interested in the self-duality arising from alternating forms. If 
E is finite dimensional and @ is an element of L?(E), that is an alternat- 
ing 2-form, which is non-singular, then one sees easily that the dimension 
of E is even. 


Example. An example of such a form on R" is the following. Let 


V = (Vy, ---5 Uns V15 +++ 5Un)s 


W = (Wy, ---5 Was Wr eWay 
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be elements of R", with components »,, v/, w;, wj. Letting 
n 
o(v, w) = z (vjw; — v;W,) 
= 


defines a non-singular 2-form w on R?". It is an exercise of linear 
algebra to prove that any non-singular 2-form on R2" is linearly isomor- 
phic to this particular one in the following sense. If 


f:EoF 


is a linear isomorphism between two finite dimensional spaces, then it 
induces an isomorphism 
f*: L2(F) > L2(E). 


We call forms @ on E and w on F linearly isomorphic if there exists a 
linear isomorphism f such that f*~ =a. Thus up to a linear isomor- 
phism, there is only one non-singular 2-form on R?". (For a proof, cf. for 
instance my book Algebra.) 

We are interested in the same question on a manifold locally. Let U 
be open in the Banach space E and let x) ¢ U. A 2-form 


wo: U > L2(E) 


is said to be non-singular if each form w(x) is non-singular. If & is a 
vector field on U, then wo € is a 1-form, whose value at (x, w) is given 


(co o €)(x)(w) = (x) (E(x), w). 
As a special case of Proposition 6.1, we have: 
Let w be a non-singular 2-form on an open set U in EK. The association 
Crr@ol 


is a linear isomorphism between the space of vector fields on U and the 
space of 1-forms on U. 


Let 
ow: U > L2(U) 


be a 2-form on an open set U in E. If there exists a local isomorphism f 
at a point x9 € U, say 


FO ARVS 
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and a 2-form w on V, such that f*#=q@ (or more accurately, w re- 
stricted to U,), then we say that w is locally isomorphic to W at Xo. 
Observe that in the case of an isomorphism we can take a direct image 
of forms, and we shall also write 


f,@ = y 
instead of w = f*. In other words, f, =(f~*)*. 


Example. On R2" we have the constant form of the previous example. 
In terms of local coordinates (x,,...,X,,Y1,---»Y,), this form has the 
local expression 


ox, y= Ye dey a dy. 


This 2-form will be called the standard 2-form on R2". 


The Darboux theorem states that any non-singular closed 2-form in 
R”" is locally isomorphic to the standard form, that is that in a suitable 
chart at a point, it has the standard expression of the above example. A 
technique to show that certain forms are isomorphic was used by Moser 
[Mo 65], who pointed out that his arguments also prove the classical 
Darboux theorem. Moser’s theorem will be given in Chapter XI, §6. 

Alan Weinstein observed that Moser’s proof applies to the infinite 
dimensional case, whose statement is as follows. 


Theorem 8.1 (Darboux Theorem). Let E be a self-dual Banach space. 
Let 
aw: U > L2(E) 


be a non-singular closed 2-form on an open set of E, and let x,€U. 
Then q is locally isomorphic at x9 to the constant form @(xo). 


Proof. Let @p = @(Xo), and let 
O, = Wo + t(@ — Wo), Ost<l. 
We wish to find a time-dependent vector field €, locally at 0 such that if 
« denotes its flow, then 


a*w, = Wo- 


Then the local isomorphism «a, satisfies the requirements of the theorem. 
By the Poincaré lemma, there exists a 1-form @ locally at 0 such that 


@ — Wy = dd, 
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and without loss of generality, we may assume that 6(x9) =0. We con- 
tend that the time-dependent vector field &,, such that 


Oo; 2 er = —6, 
has the desired property. Let « be its flow. If we shrink the domain of 
the vector field near x, sufficiently, and use the fact that 0(x,.) = 0, then 
we can use the local existence theorem (Proposition 1.1 of Chapter IV) to 


see that the flow can be integrated at least to t= 1 for all points x in 
this small domain. We shall now verify that 


d 
a ot Oe) =0. 


This will prove that a*@, is constant. Since we have atw@o = Wo because 
a(0, x) = x and D,«(0, x) = id, 
it will conclude the proof of the theorem. 
We compute locally. We use the local formula of Proposition 5.2, and 
formula LIE 1, which reduces to 


£0; = d(o, o b1), 


because dw, = 0. We find 


d d 
qt ov) = of 6 o,) + ot (L,,@,) 


dt 

= af (5, @, + d(a, 0 :)) 
dt 

= a*(w — Wy — dO) 

=0. 


This proves Darboux’s theorem. 


Remark 1. For the analogous uniqueness statement in the case of a 
non-singular symmetric form, see the Morse—Palais lemma of Chapter 
VI, §5. Compare also with Theorem 2.2 of Chapter XIII. 


Remark 2. The proof of the Poincaré lemma can also be cast in the 
above style. For instance, let ¢,(x) = tx be a retraction of a star shaped 
open set around 0. Let €, be the vector field whose flow is ¢,, and let w 
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d baa = OL = o* dC = dg* C 
ae o= ¢; é,0 a ¢; (33 o= ¢; é,0. 
Since ¢f@ = 0 and ¢, is the identity, we see that 
at** 


1 1 
0 0 


is exact, thus concluding a proof of Poincaré’s theorem. 
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CHAPTER VI 


The Theorem of Frobenius 


Having acquired the language of vector fields, we return to differential 
equations and give a generalization of the local existence theorem known 
as the Frobenius theorem, whose proof will be reduced to the standard 
case discussed in Chapter IV. We state the theorem in §1. Readers 
should note that one needs only to know the definition of the bracket of 
two vector fields in order to understand the proof. It is convenient to 
insert also a formulation in terms of differential forms, for which the 
reader needs to know the local definition of the exterior derivative. 
However, the condition involving differential forms is proved to be equiv- 
alent to the vector field condition at the very beginning, and does not 
reappear explicitly afterwards. 

We shall follow essentially the proof given by Dieudonné in his Foun- 
dations of Modern Analysis, allowing for the fact that we use freely the 
geometric language of vector bundles, which is easier to grasp. 

It is convenient to recall in §2 the statements concerning the existence 
theorems for differential equations depending on parameters. The proof 
of the Frobenius theorem proper is given in §3. An important applica- 
tion to Lie groups is given in §5, after formulating the theorem globally. 

The present chapter will not be used in the rest of this book. 


VI, §1. STATEMENT OF THE THEOREM 


Let X be a manifold of class C? (p= 2). A subbundle E of its tangent 
bundle will also be called a tangent subbundle over X. We contend that 
the following two conditions concerning such a subbundle are equivalent. 
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FR 1. For each point ze X and vector fields €, y at z (i.e. defined on 
an open neighborhood of z) which lie in E (ie. such that the 
image of each point of X under &€, n lies in E), the bracket [é, n] 
also lies in E. 


FR 2. For each point ze X and differential form @ of degree 1 at z 
which vanishes on E, the form dw vanishes on & x y whenever €, 
n are two vector fields at z which lie in E. 


The equivalence is essentially a triviality. Indeed, assume FR 1. Let w 
vanish on E. Then 


(do, € x n> = —<a, [6,n]> — n<@, > + &<@, 1). 


By assumption the right-hand side is 0 when evaluated at z. Conversely, 
assume FR 2. Let €, 1 be two vector fields at z lying in E. If [€, 4](z) is 
not in E, then we see immediately from a local product representation 
and the Hahn-Banach theorem that there exists a differential form @ of 
degree 1 defined on a neighborhood of z which is 0 on E, and non-zero 
on [é, 4](z), thereby contradicting the above formula. 

We shall now give a third condition equivalent to the above two, and 
actually, we shall not refer to FR 2 any more. We remark merely that in 
the finite dimensional case, it is easy to prove that when a differential 
form @ satisfies condition FR 2, then dw can be expressed locally in a 
neighborhood of each point as a finite sum 


da =) y, A a; 


where y; and w; are of degree 1 and each w; vanishes on E. We leave 
this as an exercise to the reader. 


Let E be a tangent subbundle over X. We shall say that E is 
integrable at a point x, if there exists a submanifold Y of X containing 
X 9 such that the tangent map of the inclusion 


piYox 


induces a VB-isomorphism of TY with the subbundle E restricted to Y. 
Equivalently, we could say that for each point ye Y, the tangent map 


induces a toplinear isomorphism of T,Y on E,. Note that our condition 
defining integrability is local at xy». We say that E is integrable if it is 
integrable at every point. 
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Using the functoriality of vector fields, and their relations under tan- 
gent maps and the bracket product, we see at once that if E is integrable, 
then it satisfies FR 1. Indeed, locally, vector fields having their values in 
E are related to vector fields over Y under the inclusion mapping. 

Frobenius’ theorem asserts the converse. 


Theorem 1.1. Let X be a manifold of class C? (p 2 2) and let E be a 
tangent subbundle over X. Then E is integrable if and only if E 
satisfies condition FR 1. 


The proof of Frobenius’ theorem will be carried out by analyzing the 
situation locally and reducing it to the standard theorem for ordinary 
differential equations. Thus we now analyze the condition FR 1 in terms 
of its local representation. 

Suppose that we work locally, over a product U x V of open subsets 
of Banach spaces E and F. Then the tangent bundle T(U x V) can be 
written in a natural way as a direct sum. Indeed, for each point (x, y) in 
U x V we have 


Ty, (U x V) = TU) x T,(V). 


One sees at once that the collection of fibers T,(U) x 0 (contained in 
T,(U) x T,(V)) forms a subbundle which will be denoted by T,(U x V) 
and will be called the first factor of the tangent bundle. One could 
define T(U x V) similarly, and 


T(U x V) = T,(U x V)® T,(U x V). 


A subbundle E of T(X) is integrable at a point ze X if and only if 
there exists an open neighborhood W of z and an isomorphism 


g:UxVo>W 
of a product onto W such that the composition of maps 
inc. TG 
T(U x V)—> T(U x V) > T(W) 


induces a VB-isomorphism of T,(U x V) onto E|W (over ¢). Denoting 
by , the map of U into W given by 9,(x) = @(x, y), we can also express 
the integrability condition by saying that Tg, should induce a toplinear 
isomorphism of E onto E,,,,,, for all (x, y) in U x V. We note that in 
terms of our local product structure, T.g, is nothing but the partial 
derivative D, p(x, y). 

Given a subbundle of T(X), and a point in the base space X, we know 
from the definition of a subbundle in terms of a local product decompo- 
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sition that we can find a product decomposition of an open neighborhood 
of this point, say U x V, such that the point has coordinates (x9, yo) and 
such that the subbundle can be written in the form of an exact sequence 


OSU x VRESU KV EXE 


with the map 
(Xo, Yo): E> Ex F 


equal to the canonical embedding of E on E x 0. For a point (x, y) in 
U x V the map f(x, y) has two components f,(x, y) and f,(x, y) into E 
and F respectively. Taking a suitable VB-automorphism of U x V x E if 
necessary, we may assume without loss of generality that f,(x, y) is the 
identity. We now write f(x, y) = f(x, y). Then 


f:.U x V>LE€E, F) 
is a morphism (of class C?~') which describes our subbundle completely. 
We shall interpret condition FR 1 in terms of the present situation. 
If 
é:Ux V>EXxF 
is the local representation of a vector field over U x V, we let €, and C2 
be its projections on E and F respectively. Then é lies in the image of f 


if and only if 
2(x, y) =e I(x, you (x, y) 


for all (x, y) in U x V, or in other words, if and only if € is of the form 


&(x, y) = (1 (x, y), I(x, y)oi(x, y)) 

for some morphism (of class C?~') 

€é:UxV-oE 
We shall also write the above condition symbolically, namely 
(1) € =(61, f° 61). 
If €, 1 are the local representations of vector fields over U x V, then 
the reader will verify at once from the local definition of the bracket 
(Proposition 1.3 of Chapter V) that [é, 1] lies in the image of f if and 
only if 


Df (x, y)* (x, y)-ni(x, y) = Df(x, y)- n(x, y)* o1(%, y) 
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or symbolically, 
(2) Df-o-ny = Df-n-¢,. 

We have now expressed all the hypotheses of Theorem 1.1 in terms of 
local data, and the heart of the proof will consist in proving the follow- 


ing result. 


Theorem 1.2. Let U, V be open subsets of Banach spaces E, F 
respectively. Let 


f:Ux VoL, F) 
be a C'-morphism (r = 1). Assume that if 
€,,4,:UxV—-E 
are two morphisms, and if we let 
C=(61,f°5:) and yn =(m1, f°) 
then relation (2) above is satisfied. Let (xo, yo) be a point of U x V. 


Then there exists open neighborhoods Uy), Vo of Xo, Yo respectively, 
contained in U, V, and a unique morphism a: Uy x Vo > V such that 


D, a(x, y) = f(x, a(x, y)) 
and a(X9, y) = y for all (x, y) in Uy x Vo. 
We shall prove Theorem 1.2 in §3. We now indicate how Theorem 1.1 


follows from it. We denote by a, the map «,(x) = a(x, y), viewed as a 
map of U, into V. Then our differential equation can be written 


Da,(x) = f(x, o,(x)). 


g:Uyx Wr7UxV 


We let 


be the map ¢(x, y) = (x, a,(x)). It is obvious that De(xo, yo) is a top- 
linear isomorphism, so that @ is a local isomorphism at (Xo, yo). Fur- 
thermore, for (u, v)€ E x F we have 


D, (x, y)-(u, v) = (u, Da,(x)-u) = (u, f(%, a(2))-u) 


which shows that our subbundle is integrable. 
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VI, §2. DIFFERENTIAL EQUATIONS DEPENDING 
ON A PARAMETER 


Proposition 2.1. Let U, V be open sets in Banach spaces E, F respec- 
tively. Let J be an open interval of R containing 0, and let 


g.JxUxV—oF 
be a morphism of class C' (r 21). Let (x9, ¥o) be a point in U x V. 


Then there exists open balls Jo, Uy, Vo centered at 0, Xo, Yo and 
contained J, U, V respectively, and a unique morphism of class C’ 


B: Jo X Up X WV 
such that B(0, x, y) = y and 
D, Bit, x, y) = g(t, x, Bt, x, y)) 
for all (t, x, y)E Jo xX Up x KH. 


Proof. This follows from the existence and uniqueness of local flows, 
by considering the ordinary vector field on U x V 


G:JxUxVo~EXxF 
given by G(t, x, y) = (0, g(t, x, y)). If B(t, x, y) is the local flow for G, 
then we let f(t, x, y) be the projection on the second factor of B(t, x, y). 
The reader will verify at once that B satisfies the desired conditions. The 
uniqueness is clear. 
Let us keep the initial condition y fixed, and write 


B(t, x) = BUt, x, y). 


From Chapter IV, §1, we obtain also the differential equation satisfied by 
B in its second variable: 


Proposition 2.2. Let notation be as in Proposition 2.1, and with y fixed, 
let B(t, x) = B(t, x, y). Then D,B(t, x) satisfies the differential equation 


D, D> Bit, x)-v = Dy g(t, x, Bt, x): v a D3g(t, x; B(t, x))* D3 B(t, x): v, 


for every veE. 
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Proof. Here again, we consider the vector field as in the proof of 
Proposition 2.1, and apply the formula for the differential equation 
satisfied by D,B as in Chapter IV, §1. 


Vi, §3. PROOF OF THE THEOREM 


In the application of Proposition 2.1 to the proof of Theorem 1.2, we 
take our morphism g to be 


g(t, Zz, y) = f(Xo + tz, y)z 


with z in a small ball E, around the origin in E, and y in V. It is 
convenient to make a translation, and without loss of generality we can 
assume that x) = 0 and yp = 0. From Proposition 2.1 we then obtain 


B: Jog X Eg x HV 


with initial condition (0, z, y) = y for all ze Ep, satisfying the differential 
equation 


D, Blt, z, y) = f(tz, Blt, z, y))>z. 


Making a change of variables of type t= as and z=a™'x for a small 
positive number a, we see at once that we may assume that J, contains 
1, provided we take E, sufficiently small. As we shall keep y fixed 
from now on, we omit it from the notation, and write f(t, z) instead of 
A(t, z, y). Then our differential equation is 


(3) D, B(t, 2) = f (tz, Bt, z)):z. 


We observe that if we knew the existence of « in the statement of 
Theorem 2, then letting f(t, z) = «(x9 + tz) would yield a solution of our 
differential equation. Thus the uniqueness of « follows. To prove its 
existence, we start with B and contend that the map 


a(x) = B(1, x) 


has the required properties for small |x|. To prove our contention it 
will suffice to prove that 


(4) Dz Bit, z) = tf (tz, Blt 2)) 
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because if that relation holds, then 


Da(x) = D,B(1, x) = f(x, BU, x) = f(x, a(x) 


which is precisely what we want. 
From Proposition 2.2, we obtain for any vector ve E, 


D,D,B(t, z)-v = tD, f(tz, B(t, z))-v-z 
+ Dz f(tz, B(t, z))- D2 B(t, z)-v-z + f(tz, Blt, z))-v. 


We now let k(t) = D,B(t, z):v — tf(tz, B(t, z)):v. Then one sees at once 
that k(0) = 0 and we contend that 


(5) Dk(t) = D,f(tz, Blt, z))-k(t)-z. 


We use the main hypothesis of our theorem, namely relation (2), in which 
we take €, and 7, to be the fields v and z respectively. We compute Df 
using the formula for the partial derivatives, and apply it to this special 
case. Then (5) follows immediately. It is a linear differential equation 
satisfied by k(t), and by Corollary 1.7 of Chapter IV, we know that the 
solution 0 is the unique solution. Thus k(t)=0 and relation (4) is 
proved. The theorem also. 


VI, §4. THE GLOBAL FORMULATION 


Let X be a manifold. Let F be a tangent subbundle. By an integral 
manifold for F, we shall mean an injective immersion 


f:YoxX 
such that at every point ye Y, the tangent map 
Tf: TY > Try X 


induces a toplinear isomorphism of T,Y on the subspace Fy. of Try)X. 
Thus Tf induces locally an isomorphism of the tangent bundle of Y with 
the bundle F over f(Y). 

Observe that the image /f(Y) itself may not be a submanifold of X. 
For instance, if F has dimension 1 (i.e. the fibers of F have dimension 1), 
an integral manifold for F is nothing but an integral curve from the 
theory of differential equations, and this curve may wind around X in 
such a way that its image is dense. A special case of this occurs if we 
consider the torus as the quotient of the plane by the subgroup generated 
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by the two unit vectors. A straight line with irrational slope in the plane 
gets mapped on a dense integral curve on the torus. 

If Y is a submanifold of X, then of course the inclusion j: Y X is an 
injective immersion, and in this case, the condition that it be an integral 
manifold for F simply means that T(Y) = F|Y (F restricted to Y). 

We now have the local uniqueness of integral manifolds, correspond- 
ing to the local uniqueness of integral curves. 


Theorem 4.1. Let Y, Z be integral submanifolds of X for the subbundle 
F of TX, passing through a point x9. Then there exists an open 
neighborhood U of xo in X, such that 


YnU=ZQOU. 


Proof. Let U be an open neighborhood of x, in X such that we have 
a chart 
U-~VxWw 
with 
Xot+(Yo> Wo), 


and Y corresponds to all points (y, Wo), y¢ V. In other words, Y corre- 
sponds to a factor in the product in the chart. If V is open in F, and W 
open in F,, with F, x F, = E, then the subbundle F is represented by the 
projection 

VxWxF, 


Vx W 


Shrinking Z, we may assume that ZcCU. Let h:Z—Vx W be the 
restriction of the chart to Z, and let h=(h,,h,) be represented by its 
two components. By assumption, h’(x) maps E into F, for every x € Z. 
Hence h, is constant, so that h(Z) is contained in the factor V x {wo}. It 
follows at once that h(Z)=V, x {wo} for some open V, in V, and we 
can shrink U to a product V, x W, (where W, is a small open set in W 
containing w,) to conclude the proof. 


We wish to get a maximal connected integral manifold for an inte- 
grable subbundle F of TX passing through a given point, just as we 
obtained a maximal integral curve. For this, it is just as easy to deal 
with the nonconnected case, following Chevalley’s treatment in his book 
on Lie Groups. (Note the historical curiosity that vector bundles were 
invented about a year after Chevalley published his book, so that the 
language of vector bundles, or the tangent bundle, is absent from 
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Chevalley’s presentation. In fact, Chevalley used: a terminology which 
now appears terribly confusing for the notion of a tangent subbundle, 
and it will not be repeated here!) 

We give a new manifold structure to X, depending on the integrable 
tangent subbundle F, and the manifold thus obtained will be denoted by 
X,;. This manifold has the same set of points as X. Let xe X. We 
know from the local uniqueness theorem that a submanifold Y of X 
which is at the same time an integral manifold for F is locally uniquely 
determined. A chart for this submanifold locally at x is taken to be a 
chart for X;. It is immediately verified that the collection of such charts 
is an atlas, which defines our manifold X,;. (We lose one order of 
differentiability.) The identity mapping 


ji xXpoXx 


is then obviously an injective immersion, satisfying the following univer- 
sal properties. 


Theorem 4.2. Let F be an integrable tangent subbundle over X. If 
f:Y>Xx 


is a morphism such that Tf: TY-- TX maps TY into F, then the induced 
map 
tp YoX, 


(same values as f but viewed as a map into the new manifold X,) is also 
a morphism. Furthermore, if f is an injective immersion, then fy induces 
an isomorphism of Y onto an open subset of Xp. 


Proof. Using the local product structure as in the proof of the local 
uniqueness Theorem 4.1, we see at once that f, is a morphism. In 
other words, locally, f maps a neighborhood of each point of Y into a 
submanifold of X which is tangent to F. If in addition f is an injective 
immersion, then from the definition of the charts on X,;, we see that f; 
maps Y bijectively onto an open subset of X;, and is a local isomor- 
phism at each point. Hence f,; induces an isomorphism of Y with an 
open subset of X;, as was to be shown. 


Corollary 4.3. Let X;(x 9) be the connected component of X, containing 
a point X9. If f: YX is an integral manifold for F passing through 


Xg, and Y is connected, then there exists a unique morphism 


h: Yo X p(X) 
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making the following diagram commutative: 


h 
Y — X,(X%o) 
f j 
X 


and h induces an isomorphism of Y onto an open subset of X>(Xo). 

Proof. Clear from the preceding discussion. 

Note the general functorial behavior of the integral manifold. If 
g:X > xX’ 


is an isomorphism, and F is an integrable tangent subbundle over X, 
then F’ = (Tg)(F) = 9,¥F is an integrable bundle over X’. Then the fol- 
lowing diagram is commutative: 


5 ee eo 


| 


xX — XxX’ 
g 


The map g, is, of course, the map having the same values as g, but 
viewed as a map on the manifold Xf. 


VI, §5. LIE GROUPS AND SUBGROUPS 


It is not our purpose here to delve extensively into Lie groups, but to 
lay the groundwork for their theory. For more results, we refer the 
reader to texts on Lie groups, differential geometry, and also to the paper 
by W. Graeub [Gr 61]. Although seemingly written to apply only to the 
finite dimensional case, this paper holds essentially in its entirety for the 
Banach case (and Hilbert case when dealing with Riemannian metrics), 
and is written on foundations corresponding to those of the present 
book. 

By a group manifold, or a Lie group G, we mean a manifold with a 
group structure, that is a law of composition and inverse, 


t!1GxG-G and G->G 
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which are morphisms. Thus each x € G gives rise to a left translation 


™:G—>G 
such that t*(y) = xy. 

When dealing with groups, we shall have to distinguish between iso- 
morphisms in the category of manifolds, and isomorphisms in the cate- 
gory of group manifolds, which are also group homomorphisms. Thus 
we shall use prefixes, and speak of group manifold isomorphism, or 
manifold isomorphism as the case may be. We abbreviate these by GM- 
isomorphism or M-isomorphism. We see that left translation is an M- 
isomorphism, but not a GM-isomorphism. 

Let e denote the origin (unit element) of G. If ve TG is a tangent 
vector at the origin, then we can translate it, and we obtain a map 


(x, vo tyv = C,(x) 
which is easily verified to be a VB-isomorphism 
Gx 7;G>TG 


from the product bundle to the tangent bundle of G. This is done at 
once using charts. Recall that 7;G can be viewed as a Banachable space, 
using any local trivialization of G at e to get a toplinear isomorphism of 
T,G with the standard Banachable space on which G is modeled. Thus 
we see that the tangent bundle of a Lie group is trivializable. 

A vector field € over G is called left invariant if t%¢ = € for all x eG. 
Note that the map 


xt €,(x) 
described above is a left invariant vector field, and that the association 
DNC, 
obviously establishes a linear isomorphism between 7,;G and the vector 
space of left invariant vector fields on G. The space of such vector fields 
will be denoted by g or [(G), and will be called the Lie algebra of G, 


because of the following result. 


Proposition 5.1. Let &, be left invariant vector fields on G. Then 
[é, n] is also left invariant. 


Proof. This follows from the general functorial formula 


T¥L¢, 1] = [r¢6, ten] = LE, 1). 
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Under the linear isomorphism of T,G with I(G), we can view I(G) as a 
Banachable space. By a Lie subalgebra of [(G) we shall mean a closed 
subspace h which splits, and having the property that if ¢, 7 eb, then 
[é, 7] €b also. 


Note. In the finite dimensional case, every subspace is closed and 
splits, so that only this last condition about the bracket product need be 
mentioned explicitly. 


Let G, H be Lie groups. A map 
f:H-G 
will be called a homomorphism if it is a group homomorphism and a 


morphism in the category of manifolds. Such a homomorphism induces a 
continuous linear map 


and it is clear that it also induces a corresponding linear map 
I(H) > I(G), 


also denoted by f,. Namely, if ve 7,H and €, is the left invariant vector 
field on H induced by »v, then 


TaGe = Sie 


The general functorial property of related vector fields applies to this 
case, and shows that the induced map 


f,: (UH) > (6) 
is also a Lie algebra homomorphism, namely for &, 4 € I(H) we have 


fil E; n] a LAS; fn]. 


Now suppose that the homomorphism f: H > G is also an immersion 
at the origin of H. Then by translation, one sees that it is an immersion 
at every point. If in addition it is an injective immersion, then we shall 
say that f is a Lie subgroup of G. We see that in this case, f induces a 
splitting injection 

Sf (CA) > 1(G). 


The image of I(H) and [(G) is a Lie subalgebra of I(G). 
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In general, let h be a Lie subalgebra of I(G) and let F, be the corre- 
sponding subspace of 7,G. For each x é€ G, let 


F, = F,. 


Then F, is a split subspace of T,G, and using local charts, it is clear that 
the collection F = {F,} is a subbundle of TG, which is left invariant. 
Furthermore, if 

f:H-G 


is a homomorphism which is an injective immersion, and if h is the 
image of I(H), then we also see that f is an integral manifold for the 
subbundle F. We shall now see that the converse holds, using Frobenius’ 
theorem. 


Theorem 5.2. Let G be a Lie group, h a Lie subalgebra of \(G), and let 
F be the corresponding left invariant subbundle of TG. Then F is 
integrable. 


Proof. I owe the proof to Alan Weinstein. It is based on the follow- 
ing lemma. 


Lemma 5.3. Let X be a manifold, let €, n be vector fields at a point Xo, 
and let F be a subbundle of TX. If €(x9) =0 and € is contained in F, 
then (6, n] (Xo) € F. 


Proof. We can deal with the local representations, such that X = U is 
open in E, and F corresponds to a factor, that is 


TX =UxF,xF, and F=U xF,. 


We may also assume without loss of generality that x» = 0. Then €(0) = 
0, and €: U + F, may be viewed as a map into F,. We may write 


E(x) = A(x)x, 


with a morphism A: U > L(E, F,). Indeed, 
1 
¢(x) = | ¢'(tx) dt x, 
0 


1 
and A(x) = pr, ° | &é'(tx) dt, where pr, is the projection on F,. Then 
0 


LE, 2] (x) = n'(x)E(x) — o'(x) n(x) 
= 1'(x)A(x)x — A'(x):x-n(x) — A(x) 0), 
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whence 


[¢, 1] (0) = A(0)n(0). 
Since A(0) maps E into F,, we have proved our lemma. 


Back to the proof of the proposition. Let & 4 be vector fields at a 
point x, in G, both contained in the invariant subbundle F. There exist 
invariant vector fields €) and 49 at x9 such that 


(Xo) = €0(Xo) and (Xo) = No(Xo). 
Let 


Cpt £6 and m1 ="—No.- 


Then ¢,, 7, vanish at xq and lie in F. We get: 
[é, n] = y [é;, nl. 
tJ 


The proposition now follows at once from the lemma. 


Theorem 5.4. Let G be a Lie group, let h be a Lie subalgebra of \(G), 
and let F be its associated invariant subbundle. Let 


jH-G 


be the maximal connected integral manifold of F passing through e. 
Then H is a subgroup of G, and j: H >G is a Lie subgroup of G. The 
association between h and j: H >G establishes a bijection between Lie 
subalgebras of \(G) and Lie subgroups of G. 


Proof. Let x€H. The M-isomorphism t* induces a VB-isomorphism 
of F onto itself, in other words, F is invariant under tx. Furthermore, 
since H passes through e, and xe lies in H, it follows that j: HG is 
also the maximal connected integral manifold of F passing through x. 
Hence x maps H onto itself. From this we conclude that if ye H, then 
xy € H, and there exists some ye H such that xy =e, whence x! eH. 
Hence H is a subgroup. The other assertions are then clear. 

If H is a Lie subgroup of G, belonging to the Lie algebra b, and F is 
the associated integrable left invariant tangent subbundle, then the inte- 
gral manifold for F passing through a given point x is simply the transla- 
tion xH, as one sees from first functorial principles. 

When b is 1-dimensional, then it is easy to see that the Lie subgroup 
is in fact a homomorphic image of an integral curve 


a:R-G 
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which is a homomorphism, and such that «’(0) =v is any vector in T,G 
which is the value at e of a non-zero element of h. Changing this vector 
merely reparametrizes the curve. The integral curve may coincide with 
the subgroup, or it comes back on itself, and then the subgroup is 
essentially a circle. Thus the integral curve need not be equal to the 
subgroup. However, locally near t = 0, they do coincide. Such an inte- 
gral curve is called a one-parameter subgroup of G. 

Using Theorem 1.5 of Chapter V, it is then easy to see that if the Lie 
algebra of a connected Lie group G is commutative, then G itself is 
commutative. One first proves this for elements in a neighborhood of 
the origin, using 1-parameter subgroups, and then one gets the statement 
globally by expressing G as a union of products 


UU::-U, 


where U is a symmetric connected open neighborhood of the unit ele- 
ment. All of these statements are easy to prove, and belong to the first 
chapter of a book on Lie groups. Our purpose here is merely to lay the 
general foundations essentially belonging to general manifold theory. 


Warning. The group of differential automorphisms of a finite dimen- 
sional manifold is “infinite dimensional” but usually not a Lie group, 
because multiplication is usually continuous only in each variable sepa- 
rately. For an analysis of this, also in the context of H? (Sobolev) spaces, 
cf. Ebin and Marsden [EbM 70]. 


CHAPTER VII 


Metrics 


In our discussion of vector bundles, we put no greater structure on the 
fibers than that of topological vector space (of the same category as those 
used to build up manifolds). One can strengthen the notion so as to 
include the metric structure, and we are thus led to consider Hilbert 
bundles, whose fibers are Hilbert spaces. 

Aside from the definitions, and basic properties, we deal with two 
special topics. On the one hand, we complete our uniqueness theorem 
on tubular neighborhoods by showing that when a Riemannian metric is 
given, a tubular neighborhood can be straightened out to a metric one. 
Secondly, we show how a Riemannian metric gives rise in a natural way 
to a spray, and thus how one recovers geodesics. The fundamental 2- 
form is used to identify the vector fields and 1-forms on the tangent 
bundle, identified with the cotangent bundle by the Riemannian metric. 

We assume throughout that our manifolds are Hausdorff and are 
sufficiently differentiable so that all our statements make sense. (For in- 
stance, when dealing with sprays, we take p = 3.) 

Of necessity, we shall use the standard spectral theorem for (bounded) 
symmetric operators. A self-contained treatment will be given in the 
appendix. 


Vil, §1. DEFINITION AND FUNCTORIALITY 


For Riemannian geometry, we shall deal with a Hilbertable vector space, 
that is a topological vector space which is complete, and whose topology 
can be defined by the norm associated with a bilinear form, which is 
symmetric and positive definite. All facts needed in the sequel concern- 
ing Hilbert spaces can be found in the Appendix. 


170 METRICS [VII, §1] 


It turns out that some basic properties have only to do with a weaker 
property of the space E on which a manifold is modeled, namely that 
the Banach space E is self dual, via a symmetric non-singular bilinear 
form. Thus we only assume this property until more is needed. We 
recall that such a form is a continuous bilinear map 


(v, w) HH <v, w> of ExE>R 


such that <v, w> = <w, v) for all v, we E, and the corresponding map of 
E into the dual space L(E) is a topological isomorphism. 


Examples. Of course, the standard positive definite scalar product on 
Euclidean space provides the easiest (in some sense) example of a self 
dual vector space. But the physicists are interested in R* with the scalar 
product such that the square of a vector (x, y, z,t) is x? + y? + 2? —12?. 
This scalar product is non-singular. For one among many nice applica- 
tions of the indefinite case, cf. for instance [He 84] and [Gu 91], dealing 
with Huygens’ principle. 


We consider L?,,,(E), the vector space of continuous bilinear forms 


ALEX EOR 


which are symmetric. If x is fixed in E, then the continuous linear form 
4,(y) = A(x, y) is represented by an element of E which we denote by Ax, 
where A is a continuous linear map of E into itself. The symmetry of 4 
implies that A is symmetric, that is we have 


A(x, y) = <Ax, y> = <x, Ay> 


for all x,y¢E. Conversely, given a symmetric continuous linear map 
A: E-E we can define a continuous bilinear form on E by this formula. 
Thus L2,,,(E) is in bijection with the set of such operators, and is itself a 
Banach space, the norm being the usual operator norm. Suppose E is a 
Hilbert space, and in particular, E is self dual. 

The subset of L2,,,(E) consisting of those forms corresponding to sym- 
metric positive definite operators (by definition such that A = el for some 
é>0) will be called the Riemannian of E and be denoted by Ri(E). 
Forms / in Ri(E) are called positive definite. The associated operator A 
of such a form is invertible, because its spectrum does not contain 0 and 
the continuous function 1/t is invertible on the spectrum. 

In general, suppose only that E is self dual. The space L?,,(E) con- 
tains as an open subset the set of non-singular symmetric bilinear forms, 
which we denote by Met(E), and which we call the set of metrics or 
pseudo Riemannian metrics. In view of the operations on vector bundles 


(Chapter III, §4) we can apply the functor L3,,, to any bundle whose 
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fibers are self dual. Thus if z: E- X is such a bundle, then we can form 
L2m(™). A section of L2,,(z) will be called by definition a symmetric 
bilinear form on z. A (pseudo Riemannian) metric on x (or on E) is 
defined to be a symmetric bilinear form on 2, whose image lies in the 
open set of metrics at each point. We let Met(z) be the set of metrics on 
m, which we also call the set of metrics on E, and may denote by Met(E). 

If E is a Hilbert space and the image of the section of L2,,,(m) lies in 
the Riemannian space Ri(z,,) at each point x, in order words, if on the 
fiber at each point the non-singular symmetric bilinear form is actually 
positive definite, then we call the metric Riemannian. Let us denote a 
metric by g, so that g(x) e Met(E,) for each xe X, and lies in Ri(E,) if 
the metric is Riemanian. Then g(x) is a non-singular symmetric bilinear 
form in general, and in the Riemannian case, it is positive definite in 
addition. 

A pair (X,g) consisting of a manifold X and a (pseudo Riemannian) 
metric g will be called a pseudo Riemannian manifold. It will be called a 
Riemannian manifold if the manifold is modeled on a Hilbert space, and 
the metric is Riemannian. 

Observe that the sections of L2,,,(m) form a vector space (abstract) but 
that the Riemannian metrics do not. They form a convex cone. Indeed, 
if a, b> 0 and g,, g, are two Riemannian metrics, then ag, + bg, is also 
a Riemannian metric. 

Suppose we are given a VB-trivialization of x over an open subset U 
of X, say 


tia (U)>3U x E. 


We can transport a given pseudo Riemannian metric g (or rather its 
restriction to x 1(U)) to U x E. In the local representation, this means 
that for each xeU we can identify g(x) with a symmetric invertible 
operator A, giving rise to the metric. The operator A, is positive definite 
in the Riemannian case. Furthermore, the map 


xPRA 


x 


from U into the Banach space L(E, E) is a morphism. 

As a matter of notation, we sometimes write g, instead of g(x). Thus 
if v, w are two vectors in E,, then g,(v,w) is a number, and is more 
convenient to write than g(x)(v,w). We shall also write <v, w), if the 
metric g is fixed once for ail. 


Proposition 1.1. Let X be a manifold admitting partitions of unity. Let 
nm: EX be a vector bundle whose fibers are Hilbertable vector spaces. 
Then x admits a Riemannian metric. 
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Proof. Find a partition of unity {U;, 9} such that 2|U, is trivial, that 
is such that we have a trivialization 


nj: 2 1(U,) > U; x E 


(working over a connected component of X, so that we may assume the 
fibers toplinearly isomorphic to a fixed Hilbert space E). We can then 
find a Riemannian metric on U, x E in a trivial way. By transport of 
structure, there exists a Riemannian metric g; on 2|U; and we let 


g= » PiGi- 
Then g is a Riemannian metric on x. 


Let us investigate the functorial behavior of metrics. 
Consider a VB-morphism 


f 


E’ — E 
| | 


Ss Ly 
fo 

with vector bundles E’ and E over X and Y respectively, whose fibers are 

self dual spaces. Let g be a symmetric bilinear form on 2, so that for 

each ye Y we have a continuous, bilinear, symmetric map 


g(y): E, x E,> R. 
Then the composite map 
E’. x El. > E, x E,>R 


with y = f(x) is a symmetric bilinear form on E', and one verifies imme- 
diately that it gives rise to such a form, on the vector bundle z’, which 
will be denoted by f*(g). Then f induces a map 


Liym(S) = f*: Ligm(™) > Liym(t). 


Furthermore, if f, is injective and splits for each x e€ X, and g is a metric 
(resp. g is a Riemannian metric in the Hilbert case), then obviously so is 
f*(g), and we can view f* as mapping Met(z) into Met(z’) (resp. Ri(z) 
into Ri(z’) in the Riemannian case). 

Let X be a manifold modeled on a Hilbertable space and let T(X) be 
its tangent bundle. By abuse of language, we call a metric on T(X) also 
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a metric on X and write Met(X) instead of Met(T(X)). Similarly, we 
write Ri(X) instead of Ri(T(X)). 
Let f: X — Y be an immersion. Then for each x € X, the linear map 


T.f: T(X) > Treo (Y) 
is injective, and splits, and thus we obtain a contravariant map 
f*: Ri(Y) > Ri(X), 


each Riemannian metric on Y inducing a Riemannian metric on X. 

A similar result applies in the pseudo Riemannian case. If (Y,g) is 
Riemannian, and f is merely of class C! but not necessarily an immer- 
sion, then the pull back f*(g) is not necessarily positive definite, but is 
merely what we call semipositive. In general, if (X, h) is pseudo Rieman- 
nian and h(v,v)20 for all ve T,X, all x, then (X,h) is called semi 
Riemannian. Thus the pull back of a semi Riemannian metric is semi 
Riemannian. 

For a major result concerning Riemannian embeddings of manifolds in 
Euclidean space, see Nash [Na 56], followed by Moser [Mo 61], as well 
as the exposition I gave in [La 61]. Even though dealing a priori with 
finite dimensional manifolds, the imbedding problem is essentially con- 
cerned with the infinite dimensional manifold of Riemannian metrics. 
The problem partly amounts to obtaining an inverse mapping theorem 
in a context more complicated than that of Banach spaces, namely Frechet 
spaces, when all C? norms intervene, for p = 1, 2, ..... Newton approxi- 
mation is used instead of the shrinking lemma to solve the local iso- 
morphism problem in this case. 


The next five sections will be devoted to considerations which apply 
specifically to the Riemannian case, where positivity plays a central role. 


Vil, §2. THE HILBERT GROUP 


Let E be a Hilbert space. The group of toplinear automorphisms Laut(E) 
contains the group Hilb(E) of Hilbert automorphisms, that is those top- 
linear automorphisms which preserve the inner product: 


<Av, Aw) = Xv, w> 


for all v, we E. We note that A is Hilbertian if and only if A*A =I. 

As usual, we say that a linear continuous map A: E> E is symmetric 
if A* = A and that it is skew-symmetric if A4* = —A. We have a direct 
sum decomposition of the Banach space L(E, E) in terms of the two 
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closed subspaces of symmetric and skew-symmetric operators: 
A = 3(A + A*) + 3(A — A*). 


We denote by Sym(E) and Sk(E) the Banach spaces of symmetric and 
skew-symmetric maps respectively. The word operator will always mean 
continuous linear map of E into itself. 


Proposition 2.1. For all operators A, the series 


A2 
exp(d4)= 1+ A+ 5 + 


converges. If A commutes with B, then 
exp(A + B) = exp(A) exp(B). 
For all operators sufficiently close to the identity I, the series 


log(4) =4—9 4 A= 9) Soe 


converges, and if A commutes with B, then 
log(AB) = log(A) + log(B). 
Proof. Standard. 


We leave it as an exercise to the reader to show that the exponential 
function gives a C”-morphism of L(E, E) into itself. Similarly, a function 
admitting a development in power series say around 0 can be applied to 
the set of operators whose bound is smaller than the radius of conver- 
gence of the series, and gives a C*-morphism. 


Proposition 2.2. If A is symmetric (resp. skew-symmetric), then exp(A) is 
symmetric positive definite (resp. Hilbertian). If A is toplinear auto- 
morphism sufficiently close to I and is positive definite symmetric (resp. 
Hilbertian), then log(A) is symmetric (resp. skew-symmetric). 


Proof. The proofs are straightforward. As an example, let us carry 
out the proof of the last statement. Suppose A is Hilbertian and suf- 
ficiently close to I. Then A*A = I and A* = A™*. Then 


log(A)* = oe J 


= log(A“*). 
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If A is close to I, so is A+, so that these statements make sense. We 
now conclude by noting that log(A~!) = —log(A). All the other proofs 
are carried out in a similar fashion, taking a star operator in series term 
by term, under conditions which insure convergence. 


The exponential and logarithm functions give inverse C® mappings 
between neighborhoods of 0 in L(E,E) and neighborhoods of IJ in 
Laut(E). Furthermore, the direct sum decomposition of L(E, E) into sym- 
metric and skew-symmetric subspaces is reflected locally in a neighbor- 
hood of I by a C® direct product decomposition into positive definite 
and Hilbertian automorphisms. This direct product decomposition can 
be translated multiplicatively to any toplinear automorphism, because if 
A € Laut(E) and B is close to A, then 


B= AA"B=A(I —(I — A™'B)) 
and (I — A™1B) is small. This proves: 


Proposition 2.3. The Hilbert group of automorphisms of E is a closed 
submanifold of Laut(E). 


In addition to this local result, we get a global one also: 


Proposition 2.4. The exponential map gives a C”-isomorphism from the 
space Sym(E) of symmetric endomorphisms of E and the space Pos(E) 
of symmetric positive definite automorphisms of E. 


Proof. We must construct its inverse, and for this we use the spectral 
theorem. Given A, symmetric positive definite, the analytic function log t 
is defined on the spectrum of A, and thus log A is symmetric. One 
verifies immediately that it is the inverse of the exponential function 
(which can be viewed in the same way). We can expand logt around a 
large positive number c, in a power series uniformly and absolutely 
convergent in an interval 0 <¢ <t < 2c — ¢, to achieve our purposes. 


Proposition 2.5. The manifold of toplinear automorphisms of the Hilbert 
space E is C®-isomorphic to the product of the Hilbert automorphisms 
and the positive definite symmetric automorphisms, under the mapping 


Hilb(E) x Pos(E) > Laut(E) 
given by 
(H, P) > HP. 


Proof. Our map is induced by a continuous bilinear map of 


L(E, E) x L(E, E) 
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into L(E, E) and so is C®. We must construct an inverse, or in other 
words express any given toplinear automorphism A in a unique way as a 
product A = HP where H is Hilbertian, P is symmetric positive definite, 
and both H, P depend C® on A. This is done as follows. First we note 
that A*A is symmetric positive definite (because <(A*Av, v> = <Ap, Av>, 
and furthermore, A*A is a toplinear automorphism, so that 0 cannot be 
in its spectrum, and hence A*A 2 eI >O since the spectrum is closed). 
We let 


P =(A*A)!2 
and let H = AP™!. Then H is Hilbertian, because 
H*H =(P7!)*A*AP™ = I. 


Both P and H depend differentiably on A since all constructions involved 
are differentiable. 

There remains to be shown that the expression as a product is unique. 
If A= H,P, where H,, P, are Hilbertian and symmetric positive definite 
respectively, then 


HH, = PP", 


and we get H,=PP,! for some Hilbertian automorphism H,. By 
definition, 
I = H*H, = (PP, 1)*PP,* 


and from the fact that P* = P and P* = P,, we find 


Taking the log, we find 2 log P = 2 log P,. We now divide by 2 and take 
the exponential, thus giving P = P, and finally H = H,. This proves our 
proposition. 


Vil, §3. REDUCTION TO THE HILBERT GROUP 


We define a new category of bundles, namely the Hilbert bundles over 
X, denoted by HB(X). As before, we would denote by HB(X,E) or 
HB(X, 2) those Hilbert bundles whose fiber is a Hilbert space E or lies 
in a category QQ. 

Let 2: E—X be a vector bundle over X, and assume that it has a 
trivialization {(U,, t;)} with trivializing maps 


tm 1(U,) > U, x E 
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where E is a Hilbert space, such that each toplinear automorphism 
(t;t;'), is a Hilbert automorphism. Equivalently, we could also say that 
1; is a Hilbert isomorphism. Such a trivialization will be called a Hilbert 
trivialization. Two such trivializations are called Hilbert-compatible if 
their union is again a Hilbert trivialization. An equivalence class of such 
compatible trivializations constitutes what we call a Hilbert bundle over 
X. Any such Hilbert bundle determines a unique vector bundle, simply 
by taking the VB-equivalence class determined by the trivialization. 

Given a Hilbert trivialization {(U;,1,)} of a vector bundle x over X, 
we can define on each fiber 2, a Hilbert space structure. Indeed, for each 
x we select an open set U; in which x lies, and then transport to 7, the 
scalar product in E by means of 1;,. By assumption, this is independent 
of the choice of U, in which x lies. Thus in a Hilbert bundle, we can 
assume that the fibers are Hilbert spaces, not only Hilbertable. 

It is perfectly possible that several distinct Hilbert bundles determine 
the same vector bundle. 

Any Hilbert bundle determining a given vector bundle x will be said 
to be a reduction of z to the Hilbert group. 

We can make Hilbert bundles into a category, if we take for the 
HB-morphisms the VB-morphisms which are injective and split at each 
point, and which preserve the metric, again at each point. 

Each reduction of a vector bundle to the Hilbert group determines a 
Riemannian metric on the bundle. Indeed, defining for each ze X and », 
we 7, the scalar product 


9x(0, w) = CTixD, TixW> 
with any Hilbert-trivializing map 1,, such that x e U;, we get a morphism 
XF>g, 


of X into the sections of L2,,(z) which are positive definite. We also 
have the converse. 


Theorem 3.1. Let 2 be a vector bundle over a manifold X, and assume 
that the fibers of x are all toplinearly isomorphic to a Hilbert space ¥. 
Then the above map, from reductions of x to the Hilbert group, into the 
Riemannian metrics, is a bijection. 


Proof. Suppose that we are given an ordinary VB-trivialization 
{(U,, t;)} of x. We must construct an HB-trivialization. For each i, let 9; 
be the Riemannian metric on U, x E transported from 2~'(U;) by means 
of t;. Then for each x € U;, we have a positive definite symmetric opera- 
tor A,, such that 


Gixl0, w) > (Aix 0, w> 
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for all v, we E. Let B,, be the square root of A;,. We define the 
trivialization o; by the formula 


Ox = BT ix 


and contend that {(U,,o;)} is a Hilbert trivialization. Indeed, from the 
definition of g;,, it suffices to verify that the VB-isomorphism 


BU; x E>U, x E 


given by B;, on each fiber, carries g; on the usual metric. But we have, 
for v, we E: 
(B;.0, Bw) = (Aix, w> 


since B,, is symmetric, and equal to the square root of A;,. This proves 
what we want. 


At this point, it is convenient to make an additional comment on 
normal bundles. 

Let a, B be two Hilbert bundles over the manifold X, and let f:«—- B 
be an HB-morphism. Assume that 


ee eee 


is exact. Then by using the Riemannian metric, there is a natural way of 
constructing a splitting for this sequence (cf. Chapter III, §5). 

Using Theorem 1.2 of the Appendix, we see at once that if F is a 
(closed) subspace of a Hilbert space, then E is the direct sum 


E=F@F+ 


of F and its orthogonal complement, consisting of all vectors perpendicu- 
lar to F. 

In our exact sequence, we may view f as an injection. For each x we 
let a} be the orthogonal complement of «, in B,. Then we shall find an 
exact sequence of VB-morphisms 


| eae 


whose kernel is a! (set theoretically). In this manner, the collection of 
orthogonal complements « can be given the structure of a Hilbert bun- 
dle. 

For each x we can write B,=a,@a , and we define h, to be the 
projection in this direct sum decomposition. This gives us a mapping 
h: B >a, and it will suffice to prove that h is a VB-morphism. In order 
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to do this, we may work locally. In that case, after taking suitable 
VB-automorphisms over a small open set U of X, we can assume that 
we deal with the following situation. 

Our vector bundle f is equal to U x E and «@ is equal to U x F for 
some subspace F of E, so that we can write E=F x F*. Our HB- 
morphism is then represented for each x by an injection f,: F > E: 


UMP 3 RE 


By the definition of exact sequences, we can find two VB-isomorphisms t 
and o such that the following diagram is commutative: 


Ux = URE 


Ux F — UxE 


and such that the bottom map is simply given by the ordinary inclusion 
of F in E. We can transport the Riemannian structure of the bundles on 
top to the bundles on the bottom by means of a”! and 1! respectively. 
We are therefore reduced to the situation where f is given by the simple 
inclusion, and the Riemannian metric on U x E is given by a family A, 
of symmetric positive definite operators on E (xe U). At each point x, 
we have <v, w>, = (A,v, w>. We observe that the map 


A:UxE>UxE 


given by A, on each fiber is a VB-automorphism of U x E. Let prg be 
the projection of Ux E on Ux F. It is a VB-morphism. Then the 
composite 

h=prpoA 


gives us a VB-morphism of U x E on U x F, and the sequence 
UxE ans UxF-0 


is exact. Finally, we note that the kernel of h consists precisely of the 
orthogonal complement of U x F in each fiber. This proves what we 
wanted. 


Vil, §4. HILBERTIAN TUBULAR NEIGHBORHOODS 


Let E be a Hilbert space. Then the open ball of radius 1 is isomorphic 
to E itself under the mapping 
v 
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the inverse mapping being 
te w 
wh>——__,, 
(1 + |w|?)*? 


If a> 0, then any ball of radius a is isomorphic to the unit ball under 
multiplication by the scalar a (or a™'). 

Let X be a manifold, and o: X —R a function (morphism) such that 
o(x) > 0 for all xe X. Let x: EX be a Hilbert bundle over X. We 
denote by E(o) the subset of E consisting of those vectors v such that, if 
v lies in E,,, then 


lvl, < a(x). 
Then E(c) is an open neighborhood of the zero section. 


Proposition 4.1. Let X be a manifold and x: E->X a Hilbert bundle. 
Let o: X +R be a morphism such that o(x)>0 for all x. Then the 
mapping 
o(nw)w 
"+ wR)? 


gives an isomorphism of E onto E(o). 


Proof. Obvious. The invese mapping is constructed in the obvious 
way. 


Corollary 4.2. Let X be a manifold admitting partitions of unity, and let 
am: E— X be a Hilbert bundle over X. Then E is compressible. 


Proof. Let Z be an open neighborhood of the zero section. For each 
x eX, there exists an open neighborhood V, and a number a, >0 such 
that the vectors in x ‘(V,) which are of length < a, lie in Z. We can 
find a partition of unity {(U,, g;)} on X such that each U, is contained in 
some V,;,. We let o be the function 


» Axi) Pi- 


Then E(o) is contained in Z, and our assertion follows from the proposi- 
tion. 


Proposition 4.3. Let X be a manifold. Let n: E— X and n,: E,—X be 
two Hilbert bundles over X. Let 


AEE, 
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be a VB-isomorphism. Then there exists an isotopy of VB-isomorphisms 
A E> E, 


with proper domain [0,1] such that 1, =A and A, is an HB-isomor- 
phism. 


Proof. We find reductions of E and E, to the Hilbert group, with 
Hilbert trivializations {(U,,1,;)} for E and {(U;,, p;)} for E,. We can then 
factor p;4t;' as in Proposition 2.5, applied to each fiber map: 


U, x E —— U,;x E —> U;xE 
Tj TG Pi 


nm *(U;) aa m(U;*) =a m'(U;) 


and obtain a factorization of 4 into A=A,Ap where Ay is a HB- 
isomorphism and Jp is a positive definite symmetric VB-automorphism. 
The latter form a convex set, and our isotopy is simply 


A, = Ay o (tI + (1 + t)Ap). 
(Smooth out the end points if you wish.) 


Theorem 4.4. Let X be a submanifold of Y. Let 1: E-— X and 1,:E, > X 
be two Hilbert bundles. Assume that E is compressible. Let f: E> Y 
and g:E,—Y be two tubular neighborhoods of X in Y. Then there 
exists an isotopy 


fi EY 


of tubular neighborhoods with proper domain [0,1] and there exists an 
HB-isomorphism yp: E> E, such that f, = f and fo = gu. 


Proof. From Theorem 6.2 of Chapter IV, we know already that there 
exists a VB-isomorphism / such that f ~ gd. Using the preceding propo- 
sition, we know that 4 = w where yu is a HB-isomorphism. Thus g/ = gu 
and by transitivity, f ~ yu, as was to be shown. 


Remark. In view of Proposition 4.1, we could of course replace the 
condition that E be compressible by the more useful condition (in prac- 
tice) that X admit partitions of unity. 
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Vil, §5. THE MORSE-PALAIS LEMMA 


Let U be an open set in some (real) Hilbert space E, and let f be a C?*? 
function on U, with p21. We say that x, is a critical point for f if 
Df(xo) = 0. We wish to investigate the behavior of f at a critical point. 
After translations, we can assume that x) =0 and that f(x,)=0. We 
observe that the second derivative D?f(0) is a continuous bilinear form 
on E. Let A= D?f(0), and for each x cE let i, be the functional such 
that yr A(x, y). If the map xd, is a toplinear isomorphism of E with 
its dual space EY, then we say that / is non-singular, and we say that the 
critical point is non-degenerate. 

We recall that a local C?-isomorphism @ at 0 is a C?-invertible map 
defined on an open set containing 0. 


Theorem 5.1. Let f be a C?*? function defined on an open neighbor- 
hood of 0 in the Hilbert space E, with p21. Assume that f(0) = 0, and 
that 0 is a non-degenerate critical point of f. Then there exists a local 
C?-isomorphism at 0, say @, and an invertible symmetric operator A such 
that 


f(x) = <Ag(x), (x)>. 
Proof. We may assume that U is a ball around 0. We have 
1 
f(x) = f(x) — f(0) = | Df(tx)x dt, 
0 
and applying the same formula to Df instead of f, we get 
1 1 
f(xy= | | D?f(stx)tx- x ds dt = g(x)(x, x) 
0 v0 


where 


g(x) = i |, D*f(stx)t ds dt. 


Then g is a C? map into the Banach space of continuous bilinear maps 
on E, and even the space of symmetric such maps. We know that this 
Banach space is toplinearly isomorphic to the space of symmetric opera- 
tors on E, and thus we can write 


F(x) = <A(x)x, x> 


where A: U > Sym(E) is a C? map of U into the space of symmetric 
operators on E. A straightforward computation shows that 


D*f(O)(v, w) = ¢A()», w). 
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Since we assumed that D7f(0) is non-singular, this means that A(0) is 

invertible, and hence A(x) is invertible for all x sufficiently near 0. 
Theorem 5.1 is then a consequence of the following result, which 

expresses locally the uniqueness of a non-singular symmetric form. 


Theorem 5.2. Let A: U + Sym(E) be a C? map of U into the open set 
of invertible symmetric operators on E. Then there exists a C? isomor- 
phism of an open subset U, containing 0, of the form 


p(x) = C(x)x, with a C? map C: U, > Laut(E) 
such that 
CA(x)x, x) = CAO) (x), (x)> = <A) C(X)x, C(x)x>. 
Proof. We seek a map C such that 
C(x)*A(0)C(x) = A(x). 


If we let B(x) = A(0)"1A(x), then B(x) is close to the identity J for small 
x. The square root function has a power series expansion near 1, which 
is a uniform limit of polynomials, and is C® on a neighborhood of I, and 
we can therefore take the square root of B(x), so that we let 


C(x) = B(x)*”. 


We contend that this C(x) does what we want. Indeed, since both A(0) 
and A(x) (or A(x)"') are self-adjoint, we find that 


B(x)* = A(x)A(0)*, 
whence 
B(x)*A(0) = A(0)B(x). 


But C(x) is a power series in I — B(x), and C(x)* is the same power 
series in I — B(x)*. The preceding relation holds if we replace B(x) by 
any power of B(x) (by induction), hence it holds if we replace B(x) by 
any polynomial in J — B(x), and hence finally, it holds if we replace B(x) 
by C(x), and thus 


C(x)*A(0)C(x) = A(0)C(x)C(x) = A(0) B(x) = A(x). 


which is the desired relation. 

All that remains to be shown is that @ is a local C?-isomorphism at 0. 
But one verifies that in fact, Dp(0) = C(0), so that what we need follows 
from the inverse mapping theorem. This concludes the proof of Theo- 
rems 5.1 and 5.2. 


184 METRICS (VII, §6] 


Corollary 5.3. Let f be a C?*? function near 0 on the Hilbert space E, 
such that 0 is a non-degenerate critical point. Then there exists a local 
C?-isomorphism wW at 0, and an orthogonal decomposition E = F + F?, 
such that if we write w(x) = y + z with ye F and zeF+, then 


f(W(x)) = <y, yd — «2, 2). 


Proof. On a space where A is positive definite, we can always make 
the toplinear isomorphism x+> A"?x to get the quadratic form to become 
the given hermitian product ¢ , >, and similarly on a space where A is 
negative definite. In general, we use the spectral theorem to decompose 
E into a direct orthogonal sum such that the restriction of A to the 
factors is positive definite and negative definite respectively. 


Note. The Morse—Palais lemma was proved originally by Morse in 
the finite dimensional case, using the Gram—Schmidt orthogonalization 
process. The elegant generalization and its proof in the Hilbert space 
case is due to Palais [Pa 69]. It shows (in the language of coordinate 
systems) that a function near a critical point can be expressed as a 
quadratic form after a suitable change of coordinate system (satisfying 
requirements of differentiability). It comes up naturally in the calculus 
of variations. For instance, one considers a space of paths (of various 
smoothness) a: [a, b] + E where E is a Hilbert space. One then defines a 
length function (see next section) or the energy function 


b 
f(o) = | <a'(t), o'(t)> dt, 


and one investigates the critical points of this function, especially its 
minimum values. These turn out to be the solutions of the variational 
problem, by definition of what one means by a variational problem. 
Even if E is finite dimensional, so a Euclidean space, the space of paths 
is infinite dimensional. Cf. [Mi 63] and [Pa 63]. 


Vil, §6. THE RIEMANNIAN DISTANCE 
Let (X, g) be a Riemannian manifold. For each C! curve 


y: [a,b] > xX 
we define its length 


b b 
L,(y) = L(y) = | <y'(0), yO}? dt = | lly'Oll, at. 


[VII, §6] THE RIEMANNIAN DISTANCE 185 


The norm is the one associated with the positive definite scalar product, 
ic. the Hilbert space norm at each point. We can extend the length to 
piecewise C' paths by taking the sum over the C’ curves constituting the 
path. We assume that X is connected, which is equivalent to the property 
that any two points can be joined by a piecewise C’ path. (If X is 
connected, then the set of points which can be joined to a given point Xo 
by a piecewise C! path is immediately verified to be open and closed, so 
equal to X. The converse, that pathwise connectedness implies con- 
nectedness, is even more obvious.) 


We define the g-distance on X for any two points x, ye X by: 


dist,(x, y) = greatest lower bound of L(y) for paths y in X 
joining x and y. 


When g is fixed throughout, we may omit g from the notation and write 
simply dist(x, y). It is clear that dist, is a semidistance, namely it is 
symmetric in (x, y) and satisfies the triangle inequality. To prove that it 
is a distance, we have to show that if x # y then dist,(x, y)>0. In a 
chart, there is a neighborhood U of x which contains a closed ball B(x, r) 
with r > 0, and such that y lies outside this closed ball. Then any path 
between x and y has to cross the sphere S(x,r). Here we are using the 
Hilbert space norm in the chart. We can also take r so small that the 
norm in the chart is given by 


Xv, W a(x) = Xv, A(x)w>, 


for v, weK, and x+>A(x) is a morphism from U into the set of 
invertible symmetric positive definite operators, such that there exist a 
number C, > 0 for which 


A(x)=C,I for all x € B(x, r). 


We then claim that there exists a constant C > 0 depending only on r, 
such that for any piecewise C' path y between x and a point on the 
sphere S(x, r) we have 

L(y) 2 Cr. 


This will prove that dist,(x, y) 2 Cr > 0, and will conclude the proof that 
dist, is a distance. 

By breaking up the path into a sum of C' curves, we may assume 
without loss of generality that our path is such a curve. Furthermore, we 
may take the interval [a, b] on which y is defined to be such that (b) is 
the first point such that »(t) lies on S(x,r), and otherwise y(t) € B(x, 7) 
for te[a,b]. Let y(b)=ru, where u is a unit vector. Write E as an 
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orthogonal direct sum 
E=Ru_1F, 


where F is a closed subspace. Then y(t) = s(t)u + w(t) with |s(t)| <r, 
s(a) = 0, s(b) =r and w(t)e F. Then 


b b 
L(y) = | Iy'(Oll, at = | <Q, A(r()y'(O>*? at 
b 
2 C;? | <7'(0, Y(O)?? at 


b 
= C}? | |s’(t)| dt by Pythagoras 


> cir 
as was to be shown. 


In addition, the above local argument also proves: 
Proposition 6.1. The distance dist, defines the given topology on X. 
Equivalently, a sequence {x,} in X converges to a point x in the given 
topology if and only if dist,(x,, x) converges to 0. 
We conclude this section with some remarks on reparametrization. 
Let 
y: [a,b] > xX 


be a piecewise C' path in X. To reparametrize y, we may do so on each 
subinterval where y is actually C1, so assume y is C!. Let 


: [c,d] > [a, b] 


be a C' map such that g(c)=a and g(d)=b. Then yo@ is C', and is 
called a reparametrization of y. The chain rule shows that 


L(y og) = L(y). 
Define the function s: [a,b] > R by 
s() = | Iy@ll,4t, so s(b)=L= L(y). 


Then s is monotone and s(a) = 0, while s(b) = L(y). Suppose that there is 
only a finite number of values t € [a, b] such that »’(t)= 0. We may then 
break up [a, b] into subintervals where y'(t) 4 0 except at the end points 
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of the subintervals. Consider each subinterval separately, and say 
a<a,<b,<b 


with y'(t) #0 for t €(a,, b,). Let s(a,) be the length of the curve over the 
interval [a, a,]. Define 


t 
I’Ollpdt for a, St Shy. 


s(t) = s(a,) + | 


a 


Then s is strictly increasing, and therefore the inverse function t = ¢(s) is 
defined over the interval. Thus we can reparametrize the curve by the 
variable s over the interval a, <t <b,, with the variable s satisfying 


s(a,) Ss S s(b,). 

Thus the whole path y on [a, b] is reparametrized by another path 
yoo: [0,L] ~X 

via a piecewise map f: [0, L] > [a, b], such that 


Iveoe@l,=1 and oly° @) =s. 


We now define a path y:[a,b]— X to be parametrized by arc length if 
\ly’@ll, = 1 for all te [a,b]. We see that starting with any path y, with 
the condition that there is only a finite number of points where y’(t) = 0 
for convenience, there is a reparametrization of the path by arc length. 

Let f: YX be a C? map with p=1. We shall deal with several 
notions of isomorphisms in different categories, so in the C? category, 
we may call f a differential morphism. Suppose (X,g) and (Y,h) are 
Riemannian manifolds. We say that f is an isometry, or a differential 
metric isomorphism if f is a differential isomorphism and f*(g)=h. If f 
is an isometry, then it is immediate that f preserves distances, i.e. that 


dist,(f(y1), f(¥2)) = dist,(y1,y2) for all y,, yr EY. 


Note that there is another circumstance of interest with somewhat weaker 
conditions when f: Y— X is an immersion, so induces an injection df(y): 
T,Y > TX for every ye Y¥, and we can speak of f being a metric 
immersion if f*(g) =h. It may even happen that f is a local differential 
isomorphism at each point of y, as for instance if f is covering map. In 
such a case, f may be a local isometry, but not a global one, whereby f 
may not preserve distances on all of Y, possibly because two points 
y, #Y2 may have the same image f(y,) = f(y2). 
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Vil, §7. THE CANONICAL SPRAY 


We now come back to the pseudo Riemannian case. 

Let X be a pseudo Riemannian manifold, modeled on the self dual 
space E. The scalar product < , > in E identifies E with its dual EY. 
The metric on X gives a toplinear isomorphism of each tangent space 
T,(X) with T,’(X). If we work locally with X =U open in E and we 
make the identification 


T(UU)=UxE and TY(U)=U x EY & T(U) 
then the metric gives a VB-isomorphism 
h: T(U) > T(U) 
by means of a morphism 
g: U > L(E, E) 
such that h(x, v) = (x, g(x)v). (In the finite dimensional case, with respect 
to an orthonormal basis, g(x) is represented by a symmetric matrix 
(gi;(x)), so the notation here fits what’s in other books with their g,;.) The 
scalar product of the metric at each point x is then given by the formula 
Xv, W>, = Cv, g(x)w> = Cg(x)v,w> for v, we E. 


For each x ¢ U we note that g’(x) maps E into L(E, E). For xe U and 
u, v€ E we write 


(9'(x)u)(v) = g'(x)u-v = g'(x)(u, v). 


From the symmetry of g, differentiating the symmetry relation of the 
scalar product, we find that for all u, v, we E, 


<g'(x)u-w, vy = Cg’(x)urv, w>. 
So we can interchange the last two arguments in the scalar product 
without changing the value. 


Observe that locally, the tangent linear map 


T(h): T(T(U)) > T(T(U)) 
is then given by 


T(h): (x, v, Uy, Uy) (x, g(x)», uy, g'(x)u, Ut g(x)uz). 
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If we pull back the canonical 2-form described in Proposition 7.2 of 
Chapter V from T‘(U) x T(U) to T(U) by means of h then its descrip- 
tion locally can be written on U x E in the following manner. 


QQ) Qa) (u,, U2) X (Wy, W2)> 
= (Uz, g(x)W2> — (Uz, g(x)W1> — <g’(x)uyv, W> + <g'(X) Wy °B, Uy >. 
From the simple formula giving our fundamental 2-form on the cotan- 
gent bundle in Chapter V, we see at once that it is nonsingular on T(U). 
Since h is a VB-isomorphism, it follows that the pull-back of this 2-form 
to the tangent bundle is also non-singular. 

We shall now apply the results of the preceding section. To do so, we 
construct a 1-form on T(X). Indeed, we have a function (kinetic energy!) 
K: T(x) ->R 
given by K(v) = 4<v, v>, if v is in T,. Then dK is a 1-form. By Proposi- 


tion 6.1 of Chapter V, it corresponds to a vector field on T(X), and we 
contend: 


Theorem 7.1. The vector field F on T(X) corresponding to —dK under 
the fundamental 2-form is a spray over X, called the canonical spray. 


Proof. We work locally. We take U open in E and have the double 
tangent bundle 


(U x E) x (Ex E) 


UxE 


Our function K can be written 
K(x, v) = 3X0, v>, = 3X2, g(x)v), 
and dK at a point (x, v) is simply the ordinary derivative 
DK(x, v): E x EOR. 


The derivative DK is completely described by the two partial derivatives, 
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and we have 
DK(x, v):(w,, W2) = D, K(x, v)-w, + D, K(x, v)- wy. 
From the definition of derivative, we find 
D, K(x, v):w, = 4<v, g’(x)w, > 
D,K (x, v)" W2 = (Wz, g(x)v> = Cv, g(x) W2 >. 


We use the notation of Proposition 3.2 of Chapter IV. We can represent 
the vector field F corresponding to dK under the canonical 2-form Q 
by a morphism f: U x E>E x E, which we write in terms of its two 
components: 


f(x, v) _ (At, v), fox, v)) oa (u, ? U2). 
Then by definition: 


(2) CQ») (fi, v), Sal, v)) x (wy > W2)> = <DK(x, v), (wy > W2)> 
= D, K(x, v)-w, + <2», g(x)w2 >. 


Comparing expressions (1) to (2), we find that as functions of w, they 
have only one term on the right side depending on w,. From the 
equality of the two expressions, we conclude that 


fix, v), g(x)W2 > = Cv, g(x)W2 > 


for all w,, and hence that f(x, v) =v, whence our vector field F is a 
second order vector field on X. 

Again we compare expression (1) and (2), using the fact just proved 
that u, = f,(x, v) =v. Setting the right sides of the two expressions equal 
to each other, and using u, = f,(u, v), we obtain: 


Proposition 7.2. In the chart U, let f =(f,, f,.): U x E>E x E repre- 
sent F. Then f,(x, v) is the unique vector such that for all w,éE we 
have: 


<fa(x v), g(x)wi> = 2¢9'(x) Wy “0, VY — Cg’ (x) 0° v, Wy, >. 


From this one sees that f, is homogeneous of degree 2 in the second 
variable v, in other words that it represents a spray. This concludes the 
proof of Theorem 7.1. 


Remark. Having represented fy,.(x,v) in the chart, we could also 
represent the associated bilinear map B,. We shall give the formula for 
By in the context of Theorem 4.2 of Chapter VHI. 


CHAPTER VIII 


Covariant Derivatives and 
Geodesics 


Throughout this chapter, by a manifold, we shall mean a C® mani- 
fold, for simplicity of language. Vector fields, forms and other objects 
will also be assumed to be C® unless otherwise specified. We let X 
be a manifold. We denote the R-vector space of vector fields by 
I'T(X). Observe that T'T(X) is also a module over the ring of func- 
tions & = §°(X). We let 

um: TX > X 


be the natural map of the tangent bundle onto X. 


Vill, §1. BASIC PROPERTIES 


By a covariant derivative D we mean an R-bilinear map 
D: TT(X) x TT(X) — TT(X), 
denoted by (€, 7) +> D,n, satisfying the two conditions: 


COVD 1. (a) In the first variable ¢, D,n is §-linear. 
(b) For a function 9, define D.p = fo = Ye to be the 
Lie derivative of the function. Then in the second variable 
n, Dzn is a derivation. Thus (a) and (b) can be written in 
the form: 


Dzn= oD and Don) =(Dze)n + Dm. 
COVD 2. D.n — D,é = [én]. 
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Remark. This second condition can be eliminated to give rise to a 
more general notion, following the ideas of a connection as described at 
the end of Chapter IV, §3. However, we concentrate here on what we 
need for some basic results, rather than develop systematically the gen- 
eral theory of connections. 


Having defined D, on functions and vector fields, we may extend the 
definition to all differential forms, or even to multilinear tensor fields. 
Let w be in I'L’(T(X)), ie. @ is a multilinear tensor field on X, not 
necessarily alternating. We define D,@ by giving its value on vector fields 
N15 -++5M,y, Namely 


(D-o) (ny prs Mr) me Lon, “¢ -M,)) ~ py a(n, ay -»Dzn;, era 
j= 


The definition of D, is such that D, satisfies the derivation property with 
respect to the r + 1 variables w, n,, ...,n,, that is 


D(o(n, i. -sMy)) = (Dz@)(n;, . My) + ps a(n, weesDeNj, oe “oMy)- 
j= 


Recall that D; = Y, on function, as on the left side of this equation. 
Looking in a local chart shows that D,@ is again a multilinear tensor 
field. It is immediate from the definition that if @ is alternating, then so 
is D,w. In particular, D, is a derivation with respect to contractions and 
it is also a derivation with respect to the wedge product, that is: 


COVD 3. D(a on,) = (Dem) ony + @o Deny. 


COVD 4. On the algebra of alternating forms, the covariant derivative 
D; is a derivation, in the sense that for two forms w and y, 
we have 

D{@ A y)= Da A y + @ A Dzy. 


The proof comes directly from the definition of the wedge product in 
Chapter V, §3. In the finite dimensional case, when a form is a sum of 
decomposable forms, i.e. wedge products of forms of degree 0 and 1, it 
follows that the above definition is the unique extension of D, to the 
algebra of differential forms. Furthermore, similarly to the formula of 
Proposition 5.1 of Chapter V, for the Lie derivative of a form, one has: 


COVD 5. (L.0)(115--.5,) = (Deo) (ny, «--%) + ps O15 «+-sDy,E «+My 


which is an alternative to 


Loony, oe My) = (D:o)(n, ON. My) + Py o(n,, .e Denis +2 My). 
i= 


r 


COVD 6. (da) (Co, fw a8 56) = > (= 1)(Dz,0)(;, oe sspebs Cos Ciats leueGole 


i=0 
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Proof. One uses the formulas given in propositions of Chapter V, 
Proposition 3.2, for dw, and Proposition 5.1 for the Lie derivative. One 
replaces brackets [8,7] by D,y —D,B. The desired formulas drop out. 
Note that only COVD 2 has been used in the proof. 


Next we give a finite dimensional formula. Recall that a frame of 
vector fields €,, ...,€, is such that for each x, €,(x), ...,é,(x) are a basis 
of T,X. 


Proposition 1.1. Let {€,,...,€,} be a frame of vector fields. Let 4,, 
.s+5A, be the dual frame of 1-forms (so 4,(&;) = 6,;). For any form 
we "(X) we have 


do = y A; \ Dz,o. 
i=1 


Proof. Let d’o =) A, A Dz. Then d’ defines an anti-derivation of 
the alternating algebra of forms, that is if wy € ./4(X) for any q, then 


dioaarawW=(doa)vavt+(-Voan dy. 


Furthermore, d'’=d on functions (as is immediately verified), and we 
verify that d’ = d on .1(X) as follows: 
(d'w)(¢, 9) = y (A; A D;,o)(é, n) 

= Y TAG) <Dz,0, > — An) <Dz,@, >] 

= > [<D,,@®, n> — CD, ys é>] 

7 (Dz, n> 2 <D,,@, é> 

= (da)(¢,n) by COVD 6, 
which concludes the proof for 1-forms. Since 1-forms generate the alge- 


bra of forms in the finite dimensional case, the proposition is proved in 
general. 


The above finite dimensional formula won’t be used until we meet 
strictly finite dimensional results, in connection with volume forms and 
integration. We included it here for completeness of the general for- 
malism. We now return to the general case which may be infinite 
dimensional. 


We can extend the covariant derivative to TX-valued forms i.e. 


sections of the bundle L’(TX, TX). If w is such a section, we define Dw 
by its values on vector fields 7,, ...,n, to be 


(Deo) (11, ---M,) = Deol, ---sm)) — z O(n» ---,Denj, --- Mr), 
j= 
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so D, satisfies the derivation property with respect to the r + 1. variables 
@, 41, .--, 4, We note that w(,,...,n,)€ TTX is a vector field, so we 
know how to apply the covariant derivative D(ao(n,, ---5m,)) instead of 
L(o(n;,....9,)) for ordinary R-valued forms, in which case o(m;,.-..,,) 
is a function on X. When o is TX-valued, we have on the other hand 


Lon gates N,)) = [é, on gee n,) 1. 


A local formula will be given in Proposition 2.2. 


Vill, §2. SPRAYS AND COVARIANT DERIVATIVES 


Let F be a spray over a manifold X. In a chart U, we index geometric 
objects by U to indicate their representatives in the chart. Thus the 
representative , of a vector field over U is a morphism 


éy: U > E. 


Similarly, we have the symmetric bilinear map associated with the spray, 
and its representative 


By(x) = 3D35 fu,2(% 0), 


where fy,2 is the second component of the representative for the spray, as 
described in Chapter IV, §3. 


Theorem 2.1. Given a spray F over X, there exists a unique covariant 
derivative D such that in a chart U, the derivative is given by the local 
formula 


(Den)y(x) = nu(™)SulX) — By (x; u(x), ny(x)). 


Or, suppressing the index U for simplicity, and thus using €, n to denote 
the local representatives of the vector fields in the chart, we have 


(Den)(x) = n'(x)E(x) — B(x; E(x), n)) 


Den = 1'-¢ — BOS, n). 


or simply 


Proof. Let us define Dn over U by the formula of the theorem. It is 
immediately verified that Dy is a vector field over U, and that the 
association (€,7)++ D,n is a covariant derivative over U: It is §&(U)- 
linear in the variable €, it is a derivation in the variable n with respect to 
multiplication by functions, and we have 


Den — D,€ = [é, 1]. 
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This last property follows from the representation of the bracket in a 
chart given by Proposition 1.3 of Chapter V. Thus a spray gives rise to 
a covariant derivative in a chart, in a natural fashion. 

We now claim that when the spray is given globally, there exists a 
unique covariant derivative on the manifold X which has the above 
representation in a chart. For this we must verify how the local repre- 
sentation changes under a change of chart. Let 


h:U 3 V 


be a C®-isomorphism, ic. a change of chart. Then we claim that the 
natural image of D.,ny under the change of chart is D,,ny, so that we 
may define D.y for any two vector fields on the manifold via the local 
representations. 

In other words, we have to verify that 


(Dz, ny) (A(x) = h'(x) (Dz,Nv)(x). 


ny (h(x) = h'(x)ny(>), 


But we have 


whence by the rule for the derivative of a product, we obtain 


(nyo hy (x) = h"(x)nu(x) + h')nu). 


Hence putting v = €,(x), w = ny(x), we get by using the change of vari- 
able formula for a spray in a chart, Proposition 3.3 of Chapter IV, 
together with the fact that h”(x) is a symmetric bilinear map: 


(Dz, ny) (h(x) = ny (h(x))h! (x) En) — By (A(x); h’(x)v, h'(x)w) 
= (ny o hy (x)Eu(x) — h" (x), w) — h'(x)By(x; v, w) 
= h"(x)(w, v) + h'(x)ny(x) Sul) 
— h"(x)(v, w) — h’(x) By (x; 0, w) 
=h'(x)(no)Eu(x) — Bul; v, w)) 
(appreciate the cancellation !) 
= h'(x)(Den)u), 


which proves the change of variable formula, and therefore concludes the 
proof of Theorem 2.1. 


The covariant derivative defined in Theorem 2.1 will be called the 
covariant derivative determined by the spray, or associated with the spray. 
As mentioned previously, one could give a similar definition of a covariant 
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derivative associated to any connection (even without the symmetry con- 
dition on the bilinear map). 

There is of course an analogous local representation for differential 
forms as follows. 


Proposition 2.2. Let w ¢€ITL'(TX, R) or TL'(TX, TX). Let €, n,, ...5n, 
be vector fields over X. If weTL'(TX,R), then in a chart U we have 
the formula 


(D:@)y(n1 u>-+oMu) 


= My (Eu) (Miu; -- Mu) + 2 O(n, ---.BulEv, Niu)» «+ +>Mru)» 
- 
If oE€ TL'(TX, TX), then 


(D@)y(N1u>-+-»Nru) = Same expression — By(éu, O(N, -sesMpu))- 


Proof. This comes directly from the definitions in §1. Observe that in 
applying the definitions, the sum 


oy Oy(Niv, : Miu’ a +oMyy) 
j= 


occurs twice, once with a + sign and once with a — sign, so cancels in the 
end. 


For the limited purposes of this book, we will not need the proposi- 
tion. It has an analogue for lifts of curves, which we shall discuss briefly 
at the end of §3. 


Converse, from covariant derivatives to sprays 


We now wish to discuss the converse of Theorem 2.1, and for this 
purpose, we have to make general remarks on localization. Let E be a 
Banach space. We say that E admits cut off functions if given two 
positive real numbers 0 <r < sy, there exists a C®-function (simply called 
function) g such that g = 1 on the ball B,) and @ =0 on the comple- 
ment of B,(0). Given any point x)¢E, we may then find similarly a 
function which is 1 in the ball B(x9) and 0 outside B,(xo). If X is a 
manifold modeled on E, then one can then find such cut off functions 
equal to 1 in a given neighborhood of a point, and 0 outside a slightly 
larger neighborhood. Manifolds modelled on a Hilbert space, and espe- 
cially finite dimensional manifolds, admit cut off functions. 
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Assume that X admits cut off functions. Let E be a vector bundle 
over X, and let € be a section of E. Let x» ¢ X. Let @ a cut off function 
near X 9. Then gé is a section of E, having the same values as & in a 
neighborhood of x». Suppose that E = TX and that D is a covariant 
derivative. Then 


(Dzn)(x) = (Dye) (x) 


for all x in a sufficiently small neighborhood of x,, because D is %-linear 
in the first variable. Since @ is constant near x9, it follows that 


(L.¢)(x) = 0 for x near Xo, 
and it therefore follows also that 


(D-(gn)) (x) = (Den)(x) 


for all x sufficiently close to xo. 

Now given an open neighborhood U, of x corresponding to a chart, 
we pick cut off functions g, w near x, such that the supports of @, w are 
contained in Uy, and g, wy =1 on an open neighborhood U of x whose 
closure is contained in Uy. Then U also corresponds to a chart, and we 
may compute 


(D.n)(x) = (D, (Wn) (x) for: sox Uz 


Thus the determination of the values of a covariant derivative can be 
carried out locally in a chart. We still need a criterion when the value of 
the covariant derivative at a given point depends only on the value of é 
at the given point. 


Lemma 2.3. Let E, F be vector bundles over X, with E finite dimen- 
sional and X admitting cut off functions. Let 


H:TE>TF 


be a linear map which is §(X)-linear, that is H(~é) = @H(é) for oe &. 
Given a point x € X, the value H(&)(x) depends only on the value &(x). 


Proof. It suffices to prove that if €(x9) =0 then H(é)(x9) = 0. There 
exists a cut off function @ near x9 by assumption, so we may give the 
proof locally. By assumption, there exists a finite number of sections e,, 
...,@, of E which form a basis for the sections locally, so there exist 
functions @,, ...,@, such that 


C= Ge, +''' + Ge, 
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locally. Then 
A(¢) = 9,H(e;) ++: + 9, H(e,). 


The condition ¢(x9)=0 is equivalent with the conditions ¢,(x9) = 0 for 
all i. Hence H(€)(x9) = 0, thus proving the lemma. 


Observe that when we obtain a covariant derivative from a spray, the 
value of the covariant derivative at a point x depends only on the value 
of the vector field €(x) (a derivative of yn however enters). This was clear 
from the local formula in Theorem 2.1, because for instance By(x; u, w) is 
defined for arbitrary vectors u, w which can then be taken to be the 
values ¢(x) and yy(x) respectively. 

Conversely, we are now interested in reversing the procedure. Specifi- 
cally, let D be a covariant derivative. We assume the existence of cut off 
functions throughout. In a chart over an open set U in E, define 


(Bu) By(x; &, m) = n'(x)E(x) — (Dz, nu) (>). 


It is immediately verified from the two properties of a covariant deriva- 
tive that By(x) is symmetric in €,, ny by COVD 2, and then By(x) is 
@(U)-bilinear in Ey, ny. Given vectors u, w € E one wants to define 


By(x)(u, w) = By (x; ¢(x), n(x)) 


for any vector fields €, 4 such that (x) =u and n(x)=w. At this point, 
we need to know that the value on the right of (By) is independent of 
the vector fields €, 1 chosen so that é(x)=v and n(x)=w. By Lemma 
2.3 we can certainly achieve this in the finite dimensional case, and in 
that case we obtain: 


Theorem 2.4. Assume X finite dimensional. Then the association of a 
covariant derivative to a spray establishes a bijection between sprays 
over X and covariant derivatives. 


In practice, Theorem 2.4 is not that useful (and it will NOT be used in 
this book) because one either starts from a spray to get a covariant 
derivative, or if one starts from some natural covariant derivative, and 
one needs the spray, the situation provides the tools to show that a 
spray can indeed be defined in a natural manner to give the covariant 
derivative. We shall see an example of this in §4, when we discuss the 
Riemannian covariant derivative. Furthermore, the finite dimensional de- 
vice used in Lemma 2.3 has had historically the unfortunate effect of 
obscuring the natural bilinear map B, thus obscuring a fundamental 
structure in expositions of differential geometry. Quite generally, connec- 
tions on any vector bundle give rise to covariant derivatives. These are 
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applicable to many contexts of topology and analysis, see for example 
[BGV 92], Chapter I, and also for instance [MokSY 93] for an entirely 
different direction. 


Vill, §3. DERIVATIVE ALONG A CURVE 
AND PARALLELISM 


Instead of using vector fields €, 7 we may carry out a similar construc- 
tion of a differentiation dealing only with curves, as follows. 
We continue to denote by F a spray over X. Let x: TX > X be the 
tangent bundle, and let 
a: J > X 


be a C! curve. By a lift y of « to TX we mean a C' curve y: J >TX 
such that my =a. We then also say that y lies above «. We denote the 
set of lifts of « by Lift(a). It is clear that Lift(«) is a vector space over R, 
and a module over the ring of functions on J. We wish to define D,,y in 
a way analogous to the way we defined D,n for vector fields ¢, 7. This is 
done by the next theorem. As in §2, we let By denote the bilinear map 
associated to the spray in a chart U. 


Theorem 3.1. There exists a unique linear map 
D,,: Lift(a) > Lift(a) 
which in a chart U has the expression 
(DyYu(t) = yult) — Bu(a(t)s av(0), ru(0). 
The map D,, satisfies the derivation property for a C' function g on J: 
(D,(9y)) (0) = o'r) + POD) (0. 


Remark. In the present context, the local representation yy of a curve 
in TU = U x E is taken to be the map on the second component, i.e. 


Yu: J > E. 
Thus y;(t) is the ordinary derivative, with values y,(t)eE. Note that in 
the case of the representation ay: JU, we have aj(t)e E also. Thus 


a(t), yy(t) and y(t) are “vectors.” 


Proof of Theorem 3.1. The proof is entirely analogous to the proof for 
Theorem 2.1, using the local representation of the bilinear map By asso- 
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ciated with a spray in charts. We have to verify that the formula of 
Theorem 3.1 transforms in the proper way under a change of charts, i.e. 
under an isomorphism 

h:U>V. 


Note that the local representation y, of the curve by definition is given 
by 
rv(t) = h'(ap(t))yu(0). 


Therefore by the rule for the derivative of a product, we find: 


y(t) = h"(ap(t))(%o(0), yo(t)) + h'(au(0), Yoel). 


Hence using the transformation rule from By to B,, Proposition 3.3 of 
Chapter IV, we get 


(De Vv(t) = yy) — By(a); a (0), yv(0)) 
= h"(ay(t)) (welt), yo) + h' (eu) rc) — bh" (ee) (%0), Po) 
— h'(ay(t)) By (a(t), ay(0), yo(O)) 
= h'(ay(t))(D,y)y(t) (because the h” term cancels), 


which proves the desired transformation formula for (D,y)y in charts. 
Thus we have proved the existence of D,-y as asserted. Its being a 
derivation is immediate from the local representation in charts. This 
concludes the proof of Theorem 3.1. 


Corollary 3.2. Let be a vector field and suppose y(t) = n(a(t)), te J. 
Let & be a vector field on X such that «'(to) = €(a(to)) for some to € J. 
Then 


(Dy¥) (to) = (Dzn)(a(to)). 


Proof. Immediate from the chain rule and the local representation of 
Theorem 3.1. 


Let «: J + X be a C?-morphism. We say that a lift y: J+ TX of « is 
a-parallel if D,,y =0. In the chart U, this is equivalent to the condition 
that 


Volt) = By(ay(t); xO), vod), 


which defines a first-order linear differential equation for yy. From the 
basic properties of linear differential equations, we obtain: 


Theorem 3.3. Let «:J—X be a C? curve in X. Let toe J. Given 
v € Tya,)X, there exists a unique lift y,: J+ TX which is a-parallel and 
such that y,(to) =v. Let Par(a) denote the set of a-parallel lifts of a. 
The map v +> y, is a linear isomorphism of T,y,)X with Par(a). 
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Proof. The existence and uniqueness simply comes from the existence 
and uniqueness of solutions of differential equations. Note that from the 
linearity of the equation, the integral curve y is defined on the whole 
interval of definition J by Proposition 1.9 of Chapter IV. 


Of course, the notion of parallelism is with respect to the given spray, 
which has been left out of the notation. We express the linearity of 
Theorem 3.3 another way in the next theorem. 

Theorem 3.4. Fix tg¢ J. For te J define the map 

Pa tgnt = P;: Tet» X 4 Tan X by P(v) = y(t, 0), 


where tt> y(t, v) is the unique curve in TX which is a-parallel and 
Y(to, v) =v. Then P, is a linear isomorphism. 


Proof. We must verify that P,(sv) = sP,(v) and Pv + w) = Pv) + Pw) 
for se R and v, we T,X. But these properties follow at once from the 
linearity of the differential equation satisfied by y, and the uniqueness 
theorem for its solutions with given initial conditions. 


The map P, is called parallel translation along «. 


Multilinear tensor fields 

Instead of dealing with vector fields, we may deal with TX-valued multi- 

linear tensor fields, or R-valued multilinear tensor fields at essentially no 

extra cost. Let E denote either TX or R. We extend D,, to a linear map 
D,,: Lift(a, L"(TX, E)) > Lift(«, L'(TX, E)) 

as follows. Let w: J L'(TX, E) be a lift of a: J+X. Let n,, ...,n, be 


lifts of « in TX (sometimes called vector fields along the curve «). We 
define D,.« by its values on (y,,...,n,) to be 


(D,©)(n1, pe Nr) = D, (on, si -M,)) ~~ p> a(n, wee Dy Nj, * sralp)s 
j= 


Thus D,, satisfies the Leibniz rule for the derivative of a multifold prod- 
uct with the r+ 1 variables w, y,, ...,n,. Note that if ,, ...,y, are 
a-parallel, so D,.n; = 0, then the formula simplifies to 


(D,,@)(1, geese Nr) = D,(o(ny o8 ee N,))- 


We shail obtain a local formula as usual. Given an index j, we define a 
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linear operator C; ,,, of 'L'(TX, E) into itself by 


(Cj, p,2) (M1 gh ts Nr) = an, gee ,B(a; a’, Nj), ass Nr): 
Proposition 3.5 (Local Expression). Let @ = @y, nj = Njy etc. represent 


the respective objects in a chart U, omitting the subscript U to simplify 
the notation. Then 


(D,,©)(n, one ss Nr) = o'(ny go Ny) +; B(a; a’, o(n, s00 8 1) Og, 7x 
+ 2 ol(n,,...,B(a; a, 1), --- Mp) 
j= 
or also 
D,@ = wo! — Ba; a’, @)dg,rx + ¥ C,p,4, 
fF 
where og, ry = 1 if E= TX and 0if E=R. 


This comes from the definition at the end of §1, and the fact that the 
ordinary derivative 


(@u(tiu, --- Mr)’ 
in the chart is obtained by the Leibniz rule (suppressing the index U) 
(O(n, ---m)Y = O's --sme) + Y Ol «ahs «th: 
Corollary 3.6. Let E= TX or R as above. Let Q: X ~ L"(TX, E) be a 


section (so a tensor field), and let w(t) = Q(a(t)), teJ. Let tye J. Let 
€ be a vector field such that a'(ty) = E(a(to)). Then 


(D,@) (to) = (DQ) (a(to)). 


Proof. Immediate from the chain rule and the local representation 
formula. 


A lift y: J > L'(TX, E) is called «-parallel if D,y = 0. The local expres- 
sion in a chart U shows that the condition D,,y = 0 is locally equivalent 
to the condition 


y = Ba’, y) -— py Ci,3,0?- 
- 


Of course, we have suppressed the subscript U from the notation. Thus 
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the condition of being a-parallel defines locally an ordinary linear differ- 
ential equation, and we obtain from the standard existence and unique- 
ness theorems: 


Theorem 3.7. Let tye J and wy) €TL(Ta,)X; Exa,))- There exists a 
unique curve y: J > L'(TX, E) which is «-parallel and such that y(to) = 
@ . Denote this curve by y,,._ The map 


My > Yao 


establishes a linear isomorphism between the Banach space L'(Ty.)X, Extto)) 
and the space of lifts Lift(«, L'(TX, E)). 


We have now reached a point where we have the parallelism analo- 
gous to the simplest case of the tangent bundle as in Theorem 3.4. 


Theorem 3.8. Let the notation be as in Theorem 3.7. For te J define 
the map 


Py to,t = Pye: L (Ty) X> Exit) =P L'(TyyX; Eww) 
by 
P,,(@o) = y(t, Mo), 


where t +> y(t, @o) is the unique a-parallel lift of « with y(0, ao) = ao. 
Then P,,, is a linear isomorphism. 


Proof. This follows at once from the linearity of the differential equa- 
tion satisfied by y, and the uniqueness theorem for its solutions with 
given initial conditions. 

Example. The metric g itself is a symmetric bilinear R-valued tensor 
to which the above results can be applied. 

Vill, §4. THE METRIC DERIVATIVE 
Let (X, g) be a pseudo Riemannian manifold. Let 
Xv, Wy, = <0, W a(x) 
denote the scalar product on the tangent bundle, with v, we T, for some 
x. If €, 4 are vector fields, then <é,), is a function on X, whose value 


at a point x is 


<E(x), n (x) Ys = <E(x), y (x) a(x) * 
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If € is a vector field, we denote 


C<é, ™g = D6, ™g = LX, Dg 


Theorem 4.1. Let (X, g) be a pseudo Riemannian manifold. There exists 
a unique covariant derivative D such that for all vector fields €, n, € we 
have 


MD 1. DXn, Oy = Den, 64 + <n, Deo ,- 


This covariant derivative is called the pseudo Riemannian derivative, or 
metric derivative, or Levi-Civita derivative. 


Proof. For the uniqueness, we shall express (Dz, (>, entirely in terms 
of operations which do not involve the derivative D. To do this, we 
write down the first defining property of a connection for a cyclic permu- 
tation of the three variables: 


o<n, Or = «Den, os + <n, Do, 
NXE, Og = <Dyls Sg + <6, Did dg 
c<é, "sg = <DzE, "4 + <é, D,n>,- 


We add the first two relations and subtract the third. Using the second 
defining property of a covariant derivative, the following property drops 
out: 


MD 2. 2<Dzn, Cd, = SM Og + NXE; Og — CXS M9, 
+ CEG. n), Gry ~~ <[¢, C1, ™,g an <n, C1, Sy: 


This proves the uniqueness. 


As to existence, define <Dzn,¢>, to be 4 of the right side of MD 2. 
If we view ¢, 4 as fixed, and ¢ as variable, then this right side can be 
checked in a chart to give a continuous linear functional on vector fields. 
By Proposition 6.1 of Chapter V, such a functional can be represented by 
a vector, and this vector defines D,y at each point of the manifold. Thus 
D.y is itself a vector field. Using the basic property of the bracket 
product with a function 9: 


[¢,on]=o[E.n]+(o)n and = [oéo,n]=o[6,n]—(M@e 


it is routinely verified that (D.n, ¢>, is §-linear in its first variable ¢, and 
also %-linear in the third variable ¢. One also verifies routinely that 
COVD 2 is also satisfied, whence existence follows and the theorem is 
proved. 
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Recall that we defined D,@ for any multilinear tensor w. In particular, 
let w@ = g be the metric. Then the defining property of the metric connec- 
tion can now be phrased by stating that for all vector fields ¢, 


Deg = 0. 


For each vector field 7 let \V, or n” be the 1-form corresponding to y 
under the metric, i.e. for all vector fields ¢, (V,n)(f) = <n, Oy. 


Corollary. For the metric derivative D and all vector fields €, we have 
the commutation rule 


Dzo\V/, = V0 Dz or D,(n’) = (Den)”. 
Proof. One line: 


(V, (Den) (6) = <Den, Og = DeXn, 64 — <n, Deg = DAV n)(C).- 


Local representation of the metric derivative 


From MD 2, we derive a local formula in a chart U. In the next 
formula, we write €, n, ¢: U +E for the representatives of vector fields in 
the chart, instead of the correct Ey, ny, Cy. Omitting the index U simpli- 
fies the notation when U is fixed throughout the discussion. Here 


g: U > L(E, E) 


denotes the operator defining the metric relative to the given non-singu- 
lar form on E, so that 


<6, Mg = (95, n> = <6, gn). 


Observe that in COVD 2 and MD 2, we took the scalar product in 
the tangent space, but in the next formula, the scalar product < , > 
without an index is the one given by our original non-singular symmetric 
bilinear form on E. 


MD 3. Locally in a chart U, the metric derivative is determined by 
the formula: 


2<Den, 90> 
= 2g o> + Km Gg SOO + gn ld — gi om. 


Proof. We apply MD 2. We express a g-scalar product in terms of 
the standard scalar product, and we use the local representations of the 
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Lie derivative and the bracket from Chapter IV, Proposition 1.1 and 
Proposition 1.3. For instance, we have the local representation 
SX Og = <M gOS 
= <8, gl> + <n g'o-0> + <n, gl > 
by using the rule for the derivative of a product. This formula is meant 


to be evaluated at each point x. Note that g’(x):E x E-E is a bilinear 
map, which is such that, for instance: 


g(x) E(x) C(x) = g(x) (E(x), (x)). 


One can work formally without putting the (x) in the notation. Similarly, 


Cle. a oS eg 
= (gl, -E—E-n> 
= (gf, n'°C> — gl, Om). 


Thus we can transform each term appearing on the right of MD 2. 
Then all the terms involving g (rather than g’) will cancel except two of 
them which are equal, and add to yield 2<gl,n’-é>. The remaining 
terms are those which are shown on the right side of MD 3. This 
concludes the proof. 


Remark. Let us denote by D% the covariant derivative associated with 
the metric g. Let ce R*. Then cg is also a metric, called a scaling of g, 
and it follows immediately from MD 3 that 


D9 = D4, 
ie. the covariant derivative is invariant under a scaling of the metric. 


Observe that the definition of the metric derivative in Theorem 4.1 
is given by a formula, namely MD 2, with its local representation 
MD 3. We want to see that the metric derivative is the one asso- 
ciated with a spray. We recall that quadratic maps and symmetric bilin- 
ear maps correspond to each other via the formulas 


Q(v) = Biv, v) = and =— Biv, w) = 3[O(v + w) — Q(v) — Q(w)]. 
The next theorem summarizes the situation. 


Theorem 4.2. Let (X, g) be a pseudo Riemannian manifold. There exists 
a unique spray on X satisfying the following two equivalent conditions. 
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MS 1. In a chart U, the associated bilinear map By satisfies the follow- 
ing formula for all v, w, z€E: 


—2¢By(x; v, w), g(x)2> = 
<g'(x)- 0-2, w> + Cg'(x)-w-z, vy — Cg"(x) 2° w, DD. 


Thus if we let fy,2(x, 0) = By(x; v, v) and fo(x, v) = (0, fu,2(% 0), 
then fy represents the spray on TU =U x E. 


MS 2. The covariant derivative associated to the spray is the metric 
derivative satisfying Theorem 4.1. 


This spray is the same as the canonical spray of Chapter VII, Theorem 
7.1. 


Proof. First observe that By as defined by the formula is symmetric in 
(v, w). The symmetry is built in the sum of the first two terms, and to see 
that the third term is symmetric, one differentiates with respect to x the 
formula 


<g(x)z, v> = <g(x)v, 2), 


which merely expresses the symmetry of g(x) itself. Thus we may form 
the quadratic map fy,>(x, v) = By(x; v, v) from the symmetric bilinear map 
By(x; v, w). It follows that fy as defined represents a spray Fy over TU. 
At this point, one may argue in two ways to globalize. 

Comparing MD 3 with MS 1 we see that the covariant derivative on 
U determined by the spray Fy is precisely the metric derivative. Theorem 
2.1 shows that if two sprays determine the same covariant derivative on 
U then they are equal. If U, V are two charts, then fy and fy are the 
local representatives of sprays Fy and F, on U and V respectively, which 
must therefore coincide on UV. Hence the family {F,} defines a spray 
F on X. Once again, Theorem 2.1 and MD 3 show that covariant 
derivative determined by F is the metric derivative. 

Furthermore, if we substitute v = w (and z = w,) in the chart formula 
of MS 1, thus giving the quadratic expression fy (x, v), then one sees 
that this expression coincides with the chart expression of Proposition 7.2 
of Chapter VII, and hence that the spray obtained in a natural way from 
the metric derivative is equal to the canonical spray of Chapter VII, 
Theorem 7.1. 

Another possibility is to admit Theorems 7.1 and 7.2 of Chapter VII, 
which already proved the existence of a spray whose quadratic map 
fu.2 is obtained from the symmetric bilinear map By as defined in MS 1. 
This gives immediately the existence of a unique spray on X having the 
representation of MS 1 in a chart U, and this spray is the canonical 
spray. That MS 2 is equivalent to MS 1 then follows from MD 3. This 
concludes the proof. 
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The spray of Theorem 4.2 will be called the metric spray. Since it is 
equal to the canonical spray, we really don’t need two names for it. 


Remark. To connect with other texts, note that in terms of local 
coordinates, the metric spray is given by a map f, satisfying the second 
order differential equation 


dx; 
dt” 


d?x; 
de = fr(x, v) and B= 


As a function of the variable v, the map f is quadratic, and minus its 
coefficients are functions of x, called the Christoffel symbols, [j,. Thus by 
definition, the above differential equation is of type 


In terms of the standard basis for R", the metric is represented by a 
matrix 


(g(x), 


and we let (g‘/) be the inverse matrix. Then the formula of Theorem 4.2 
can be written in terms of the local coordinates in terms of the Christoffel 
symbols, namely 


; 1 yOu , iv Giz 
J = yv _ 
Th 2 yy @ ( Ox; > Ox, Ox, 


If I gave priority to fit classical notation, I would have written —Iy 
instead of By for the bilinear map associated with the spray. However, 
using the letter B suggests bilinearity, whereas using the letter I would 
suggest the above mess. Besides, using B is more natural for the bilinear 
map associated to the quadratic map of the second order differential 
equation, and eliminates a minus sign from that equation. 


Theorem 4.3. Let «: J — X be a C? curve in a Riemannian manifold 
(X, g). For the metric derivative, and curves y, ¢ € Lift(«, TX), we have 
the formula 


‘Y 5 sz <Dys >; + Y Dy og: 


Furthermore, parallel translation is a metric isomorphism. In particular, 
let toe J. If yy, Yy are the unique a-parallel lifts of « with y,(to) =v 
and y,,(to) = w, then for all t, 


Volt), Ywlt)g = <P Wa. 
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Proof. The formula is proved in the same way that the computation 
proving Theorem 3.1 was parallel to the computation proving Theorem 
2.1 (giving the behavior under changes of charts). From the formula, if 
Dyy = DyC =0, it follows that <y,¢>, is constant, whence the second 


assertion follows. 


Corollary 4.4. Let @ be a C? function on X. Let « be a geodesic for 
the metric spray. Then 


(p oa)" = ((Dy grad @) 0 a, a’ >,. 


Proof. Taking the first derivative of ~ o a yields 


(g 0a) (t) = (dg) (a(t))a’(t) = <(grad ¢)(a(2)), #'(0) >, 


Now take the next derivative using Theorem 4.3 and the fact that D,.«’ 
= 0. The desired formula drops out. 


Vill, §5. MORE LOCAL RESULTS ON 
THE EXPONENTIAL MAP 


In this section, we give further results on the exponential map obtained 
from a spray. We follow the same notation as in Chapter IV, §4, and at 
first we just deal with a spray. We do not need to know whether it 
comes from a metric or not. 


Throughout the section, we let X be a manifold with a spray F. 


Instead of looking at the exponential map restricted to the tangent 
space at a given point, we may consider this map in the neighborhood of 
a point in the whole tangent bundle. Let 2: TX > X be the projection as 
always. Let x9 ¢X, with zero element 0,,¢ T,X. There exists an open 
neighborhood V of 0,, in TX on which we can define the map 


G:V>XxX — suchthat — G(v) = (av, exp,,(v)). 


It is sometimes useful to express this map in a different notation. Specifi- 
cally, if we denote a point in the tangent bundle by a pair (x, v) if 
ve T,X, then 


G(x, v) = (x, exp,(v)). 


Using a pair (x, v) is certainly the way we would write a point in the 
tangent bundle as represented in a chart U x E, with xe U and ve E. 
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Proposition 5.1. The map G is a local isomorphism at (xo, 0). 


Proof. The Jacobian matrix of G in a chart is given immediately from 
Chapter IV, Theorem 4.1 by 
id id 
0 id 


which is invertible. The inverse mapping theorem concludes the proof. 


For the next local results, it is convenient to express certain uni- 
formities in a chart, where we can measure distances uniformly in the 
model Banach space E, with a given norm. It is irrelevant to know 
whether this norm has any smoothness properties or not. It will be used 
just to describe neighborhoods of a vector 0 in the tangent bundle. I 
found [Mi 63] useful. 

Let x» ¢ X. For e>0, we let E(e) denote the open ball of elements 
veéE with |v|<«. Arbitrarily small open neighborhoods of (x9, 0) in a 
chart for TX are of the form 


Uy x E(e), 
where U, is an open neighborhood of x, in X, and ¢ is arbitrarily small. 


Corollary 5.2. Given xy eX. Let V be an open neighborhood of (Xo, 0) 
in TX such that G induces an isomorphism of V with its image, and in a 
chart, for some ¢ > 0, 


V = Up x Elo). 


Let W be a neighborhood of Xx, in X such that G(V) > W x W. Then: 


(1) Any two points x, y € W are joined by a unique geodesic in X lying 
in Up, and this geodesic depends C® on the pair (x, y). In other 
words, if t +> exp,(tv) (0 <t < 1) is the geodesic joining x and y, 
with y = exp,(v), then the correspondence 


(x, v) (x, y) 
is C®. 
(2) For each xe W the exponential exp, maps the open set in T,X 
represented by (x, E(e)) isomorphically onto an open set U(x) con- 
taining W. 


Proof. The properties are merely an application of the definitions and 
Proposition 5.1. 
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The pair (V, W) will be said to constitute a normal neighborhood of x, 
in X. Dealing with the pair rather than a single neighborhood is slightly 
inelegant, but to eliminate one of the neighborhoods requires a little 
more work, which most of the time is not necessary. It has to do with 
“convexity” properties, and a theorem of Whitehead [Wh 32]. We shall 
do the work at the end of this section for the Riemannian case. 

In the Riemannian case, given x € X, by a normal chart at x we mean 
an open ball B,(x, c) such that the exponential map 


exp,: B,(0,, c) > B,(x, c) 


is an isomorphism. 

We shall need a lemma which gives us the analogue of the commuta- 
tion rule of partial derivatives in the context of covariant derivatives. Let 
J,, J, be open intervals, and let 


o: J, x Jz > X 
be a C? map. For each fixed te J, we obtain a curve o,: J, X such 
that o,(r) = o(r, t). We can then take the ordinary partial derivative 


(eles 
10(0, 1) = of(r) =<. 


Similarly, we can define 0,0(r, t) = d0/ét. Observe that for each t, the 
curves r++ 0,0(r, t) and r+» 0,0(r, t) are lifts of rt o(r, t) in TX. 

More generally, let Q be a lift of o in TX. Then one may apply the 
covariant derivative with respect to functions of the first variable r, with 
the various notation 


0,20) = 0,0)(r,9 = P2. 
r 
Similarly, we have D,Q(r, ¢). 


Lemma 5.3. We have the rules on lifts of o to TX: 

(a) D,6, = D,0,; and 

(b) 0,<0,6, 0,0), = 2¢D, 624, 0,0),. 

Proof. Let oy represent o in a chart. Then from Theorem 3.1, 
D, 0,6y = 0,0,0y — By(Gy; 6, 0y, 0,6y). 


Since By is symmetric in the last two arguments, this proves (a). As to 
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(b), we use the metric derivative to yield 
02¢6,6, 0,0), = 2¢D, 0,6, 0,9>,, 


and we use (a) to permute the partials variables on the right, to conclude 
the proof of (b), and therefore the proof of the lemma. 


Let now (X, g) be a pseudo Riemannian manifold. For each x e X we 
have the scalar product <v,w), = (v, Ww), for v, we T,X. Let c>0. 
The equation 


(0, DY = C7 


defines a submanifold in T,X, which may be empty. If the metric is 
Riemannian, the equation defines what we call a sphere. In the case 
when the metric is pseudo Riemannian, say indefinite in the finite dimen- 
sional case, then one thinks of the equation as defining something like a 
hyperboloid in the vector space T,X. We can still define the level “hyper- 
surface” S,(c) to be the set of solutions of the above equation. Even in 
infinite dimension, we can say that the codimension of this hypersurface 
is 1. Note that 


S,(re) = rS,(c) for r>0. 


In a neighborhood of the origin 0, in T,X, the exponential map is 
defined, and gives an isomorphism which may be restricted to S,(c) inter- 
sected with this neighborhood. The image of this intersection is then a 
submanifold of a neighborhood of x in X. We look at the geodesics 
starting at x. 


Theorem 5.4. Let t +> u(t) be a curve in S,(1). Let 0 <r <b where b is 
such that the points ru(t) are in the domain of the exponential exp,. 
Define 
o(r, t)=exp,(ru(t)) for O<rs<b. 
Then 
(0,0, 010), = <u, u), = 1. 


Proof. This is immediate since parallel translation is an isometry by 
Theorem 4.3. 


Corollary 5.5. Assume (X,g) Riemannian. Let v € T,X. Suppose |\v||, =r, 
with r >0. Also suppose the segment {tv} (0 St S 1) is contained in the 
domain of the exponential. Let «(t) = exp,(tv). Then L(«) = r. 


Proof. Special case of the length formula in Theorem 5.4, followed by 
an integration to get the length. 
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Remark. The corollary is also valid in the pseudo Riemannian case, if 
one assume that v? = r? > 0, so the notion of length makes sense for the 
curve t +> exp,(tv). 


The next theorem expresses the fact that locally near x, the geodesics 
are orthogonal to the images of the level sets S,(c) under the exponential 
map. 


Theorem 5.6. Let (X,g) be pseudo Riemannian. Let x) €X and let W 
be a small open neighborhood of xo, selected as in Corollary 5.2, with ¢ 
sufficiently small. Let xeW. Then the geodesics through x are or- 
thogonal to the image of S,(c) under exp,, for c sufficiently small 
positive. 


Proof. For « sufficiently small positive, the exponential map is defined 
on S,(r) for 0<r<e, and as we have seen, the level sets S,(r) are 


submanifolds of X. Then our assertion amounts to proving that for 
every curve u: J +S,(1) and 0 <r <c, if we define 


a(r, t) = exp(ru(t)), 


then the two curves 
tr exp,(rou(t)) and r+ exp,(ru(to)) 


are orthogonal for any given value (rg, t 9), which amounts to proving 
that 
<0, 0, 0,0, = 0. 


Let D be the metric derivative. Then 


0,<0,6, 020), = (D, 0,6, 020), + <d,0, D, 0,0), 
= 0, 


because first, for a geodesic «, we know that the metric derivative has the 
property that D,.«’ = 0, so 


D,0,0 =0; 


and second, we use Lemma 5.3 (b) and Theorem 5.4 so the derivative of 
the constant 1 is equal to 0. Therefore the function 


rho <O,a(r, t), d,0(r, t)>, 
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is constant as a function of r. But for r = 0, we have 


o(0, t) = exp,(0) = x, independent of t. 
Hence 
0,0(0, t) = 0. 
It follows finally that 


0,6, 020), = 0 
thus concluding the proof of the theorem. 


In the Riemannian case, the theorem is known as Gauss’ lemma. 
Helgason [He 61] showed in the analytic case that it is valid in the 
pseudo Riemannian case as well. I followed the proof given in [Mi 63], 
which I found applicable to the present context without coordinates, and 
without assuming analyticity. 


Convexity 


We conclude this section with the more systematic study of convexity, 

which was bypassed in Corollary 5.2. We shall treat the Riemannian 

case, which is slightly simpler. So we assume that (X, g) is Riemannian. 
We need to know: 


Given x € X, there exists c > 0 such that if 0<r<_c, then the geodesic 
a such that a(t) =exp,(tv), with O<t<1, and |lvl|,=r, is the shortest 
piecewise C1 path between x and exp,(v). 


This will be proved in Theorems 6.2 and 6.4 of the next section. In 
particular, dist, (x, exp,(v)) =r for r sufficiently small. As usual, we let: 


B,(0,., 7) = open ball in T,X centered at 0,, of radius r; 
B,(x, r) = open ball in X centered at x, of radius r; 
S,(0,, 7) = sphere of radius r in T,X, centered at 0,; and 


S,(x, r) = sphere of radius r in X, centered at x. 


Here we shall deal only with r sufficiently small. 

We define an open set U of X to be convex if given x, ye U there 
exists a unique geodesic in U joining x to y, and such that the length of 
the geodesic is dist,(x, y). We shall prove Whitehead’s theorem [Wh 32] 
in the form: 


Theorem 5.7. Let (X, g) be a Riemannian manifold. Given x € X, there 
exists c >0 such that for all r with 0<r<c the open neighborhood 
B,(x, r) = exp, B,(0,, r) is convex. 


Proof. We need a lemma. 
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Lemma 5.8. Given x € X, there exists c >0 such that if r<_c, and if « 
is a geodesic in X, tangent to S,(x,r) at y = a(to), then a(t) lies outside 
S,(x, r) for t # ty in some neighborhood of to. 


Proof. We pick c such that the exponential map exp, is a differential 
isomorphism on B,(0,,1r) for all r<c and preserves distances on rays 
from 0, to ve T,X with |lv||,=r. Without loss of generality, we can 
suppose ty = 0, so «(0) = y. We shall view y as variable, so we index « 
by y. Also we have to look at the other initial condition «’(0) = ue T,Y, 
so we write a, ,, for the geodesic. Now let 


Nyult) =expy' at) and — fy, (t) = Ny, u(6)?- 


Then 7, ,, is a curve in the fixed Hilbert space T,X, so 


fy ult) = 2<1}, ult), Ny, u(t) ater 
Fill) = 21}, u(t)? + 2<N5,u(t), Ny, ull) Pac) 


Let h(y, u) = f,',(0). Then h(x, u) = 2u?, so h, as a function on T,X is 
positive definite. Therefore there exists c > 0 such that for 0<r<c and 
lyll,=r the function h, is positive definite on TY, and in particular 
h(y,u)>0 for u2 #0. Under the assumption that «,, is tangent to 
S,(x, r) at y, we must have 


f0)=0 and —fy’,(0) = Ay, u) > 0, 
whence for sufficiently small |t|, we get 


Fyult) > fy,u(0) = (expz* ,4(0))? = (expy*(u))? = 77, 
which proves the lemma. 


We can now conclude the proof of Theorem 5.7. Using Corollary 5.2, 
we can find c, > 0 such that putting W = B,(x, c,) satisfies the condition 
of Corollary 5.2. Let c<c,. We show that r<c implies B,(x,r) is 
convex. Let y, zé B,(x,r). Then by that corollary, there exists a unique 
geodesic « in the neighborhood V of x joining y and z. As in the lemma, 
let 

f(t) = (expy' a(t))?, swith astsb. 


It now suffices to prove that f(t) <r’. Suppose f(t) =r? for some t, and 
let to € [a,b] be the maximum of f on this interval, so f(tg) 2r?. Then 
to #a, b so f'(to) =0, whence a is tangent to the sphere S,(x, 7) where 
ro = f(to)'?. The lemma now gives a contradiction, which concludes the 
proof of Theorem 5.7. 
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Remark. In the pseudo Riemannian case, with metric g, one has to 
use an auxiliary Riemannian metric h to apply a similar argument, which 
makes the proof slightly longer. 


Vill, §6. RIEMANNIAN GEODESIC LENGTH 
AND COMPLETENESS 


Throughout this section, we let (X, g) be a Riemannian manifold. 


We return to the Riemannian case, where we use the positive definiteness 
of the metric. In Chapter VII, §6 we defined the length of a piecewise C* 
path. We want to compare the length locally with the length of straight 
lines in the tangent space at a point, under the exponential map. In the 
process, we shall see that locally, a geodesic is the shortest path between 
two points. 


Thus let x) € X and let (V, W) be a normal neighborhood as in Corol- 
lary 5.2. Let xe W. For each piecewise C’ path 


y: [a, b] > U(x) — {x}, 


with U(x) being as in Corollary 5.2(2), we can use the fact that the 
exponential map is invertible and so there exists a unique curve t +> u(t) 
in TM such that ||u(t)|, = 1 and 


y(t) = exp,(r(t)u(t)) with O<r(t)<e. 
In a chart, the vector r(t)u(t) is obtained by the inverse of the exponential 
map followed by a projection, so in particular, the functions t +> r(t) and 
t+» u(t) on [a,b] are piecewise C1. We call these functions the local 


polar coordinates for y. 


Lemma 6.1. For a piecewise C’ curve y: [a,b] > U(x) — {x} as above, 
we have the inequality 


L(y) 2 Ir(b) — r(a)l. 


Equality holds only if the function t ++ r(t) is monotone and the map 
t +» u(t) is constant. 


Proof. Let o(r, t) = exp,(ru(t)). Then y(t) = o(r(t), t). We have 


vy _4y _ Oo, do 
MOS ap ae 
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By the Gauss Lemma Theorem 5.6, we know that do0/dr and do/dt are 
orthogonal. Since ||é0/dr||, = 1 by Lemma 5.4, it follows that 


6 2 
wos =wor + |Z] zion, 
g 


with equality holding only if d0/dt = 0, or equivalently, du/dt = 0. Hence 


b 


b 
Lo)= | by e@lyde& [WC dez In) — rok 


a 


and equality holds only if t > r(f) is monotone and t +> u(t) is constant. 
This completes the proof. 


Theorem 6.2. Let (V, W) constitute a normal neighborhood of a point 
x 9 €X. Let a: [0,1] —V be the geodesic (up to reparametrization) in V 
joining two points of W (namely «(0) and a(1)). Let y: [0,1] > X be 
any other piecewise C' path in X joining these two points. Then 


L(«) S L(y). 


If equality holds, then the polar component t +> v(t) for y is constant, 
the function t +> r(t) is monotone, and a reparametrization of y is equal 
to a. 


Proof. Let x, ye W and let y = exp,(ru) with O<r<e, and |u|, = 1. 
Then for 6>0 and 0<6 <r the path y contains a segment joining the 
shell Sh,(x, 5) with the shell Sh,(x,r) and lying between the two shells. 
By Lemma 6.1, the length of this segment is = r — 6. Letting 6 tend to 0 
shows that L(y) =r. The same lemma proves the conditions on the polar 
functions as asserted. 


Corollary 6.3. Let «:[0,1]— X be a piecewise C* path, parametrized 
by arc length. If L(a) < L(y) for all paths from «(0) to «(1) in X, then 
a is a geodesic. 


Proof. We can find a partition of [0,1] such that the image under « 
of each small interval in the partition is contained in some neighborhood 
W as in the theorem, and its length is small so the image of the segment 
is contained in a normal neighborhood. By Theorem 6.2, the path re- 
stricted to this segment must be a geodesic. Hence the entire path is a 
geodesic, as was to be shown. 


Let «: [a,b] + X be a geodesic. We say that « is a minimal geodesic if 
L(«) < L(y) for every path y joining «(a) and a(b) in X. Theorem 6.2 
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gives us the existence of minimal geodesics locally. We can then formu- 
late another application. Let xe X. Let dist, be the Riemannian dis- 
tance. Let: 

B,(0,, 7), S,(0,, 7), B,(x, r), S,(x, r) 


be the open balls and spheres of radius r, centered at 0, in T,X and at x 
in X, respectively. We now know enough to show that S,(x,r) is the 
image of S,(0,,r) under the exponential map, and similarly for the open 
ball, for sufficiently small r. 


Theorem 6.4. Let (X,g) be a Riemannian manifold and let x ¢ X. There 
exists c > 0 such that for all r<c the map exp, is defined on B,(0,, c), 
gives a differential isomorphism 


exp,: B,(0,., r) > B,(x, r) for allr with O<r<e, 
and also a differential isomorphism 
exp,: S,(0,., r) > S,(x, r) for O0<r<e. 
Proof. Immediate from Corollary 5.5 and Theorem 6.2. 


Next we consider completeness. Since X is a metric space (in the 
ordinary sense), with respect to the distance dist,, the notion of X being 
complete is standard: every Cauchy sequence for dist, converges. On the 
other hand, we can now define another notion of completeness. 

We say that (X, g) is geodesically complete if and only if the maximal 
interval of definition of every geodesic in X is all of R. Alternatively, we 
could say that for each point x € X, the exponential map exp, is defined 
on all of T., because under one normalization of the parametrization of a 
geodesic, it is simply the curve t +> exp,(tv) for some ve T,X. To be 
systematic, let us consider the following conditions: 


COM 1. As a metric space under dist,, X is complete. 
COM 2. All geodesics in X are defined on R. 


COM 3. For every x eX, the exponential exp, is defined on all of 
T,X. 


COM 4. For some x € X, the exponential exp, is defined on all of T,X. 


Proposition 6.5. Each condition implies the next, i.e. 


COM 1=>COM 2=>COM 3=>COM 4. 
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Proof. Assume COM 1. Let a: J— X be a geodesic parametrized by 
arc length on some interval, and take J to be maximal in R. By the 
existence and uniqueness theorem for differential equations, J is open in 
R, and it will suffice to prove that J is closed, or in other words, that J 
contains its end points. For t,, t,¢J we have 


dist(a(t,), a(tz)) S$ |tz — til. 


Suppose for instance that J is bounded above, and let {t,} be a sequence 
in J converging to the right end point of J. Then the sequence {a(t,)} is 
Cauchy by the above inequality, so {a(t,)} converges to a point xX» by 
COM 1. Then for all n sufficiently large, a(t,)-lies in a small normal 
neighborhood of x9, and there is some ¢ > 0, independent of n, such that 
the geodesic can be extended to an interval of length at least ¢ beyond t,, 
thus contradicting the maximality of J, and proving COM 2. The subse- 
quent implications are trivial, so the proposition is proved. 


We are now interested when geodesic completeness implies complete- 
ness. We shall give two criteria for this. One of them is that the mani- 
fold has finite dimension, and the other one will be important for its 
application to conditions on curvature in Chapter IX. The finite dimen- 
sional case depends on the next result. 


Theorem 6.6 (Hopf—Rinow). Assume that (X,g) is connected geode- 
sically complete, and finite dimensional. Then any two points in X can 
be joined by a minimal geodesic. 


Proof. I follow here the variation of the proof given in [Mi 63]. 
Let p, y be two points with p#y. Let W be a normal neighborhood 
of p containing the image of a small ball under the exponential map 
exp,. Let r=dist(p, y), and let 6 be small <r. Then the shell 
Sh,(p, 6) = Sh(p, 5) is contained in W. Since Sh(p, 6) is the image of 
the sphere of radius 6 in T,X, it follows that Sh(p, 6) is compact. Hence 
there exists a point x, on Sh(p, 6) which is at minimal g-distance from y, 
that is 


dist(x 9, y) S dist(x, y) for all x € Sh(p, 6). 


We can write x9 = exp,(du) for some ue T, with |lul|,=1. Let a(t)= 
exp,(tu). We shall prove that exp,(ru) = y. We prove this by “continu- 
ous induction” on t, as it were. More precisely, we shall prove: 


(dist,) We have dist(a(t) y)=r—t for d<Str. 
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Taking t =r will prove the theorem. First we note that (dists) is true. 
Indeed, every path from p to y intersects the shell Sh(p, 6), so 


(1) dist(p, y) = min (dist(p, x) + dist(x, y)) for x €Sh(p, 6) 
= 6 + min dist(x, y) 
= 6 + dist(xo, y), 


so (dist;) is true. Now “inductively”, assume that (dist,) is true for all 
t<r’, with 6<r'<r. Let r, be the least upper bound of such r’. Since 
the distance dist, is continuous, it follows that (dist, is true, and it 
suffices to prove that r, =r. Suppose r; <r. Pick 6, small so we get as 
usual a spherical shell Sh(«(r,), 5,) around «(r,), contained in a normal 
neighborhood of a(r,). As in (1), there is a point x, on Sh(«(r,), 6,) at 
minimal distance from y, and we have the relation as in (1), namely 


dist(a(r,), y) = 6, + dist(x,, y). 

Since (dist, ) is true, we find 
(2) dist(x,, y)=r—r, —6,. 
We claim that x, = a(r, + 6,). To see this, first observe that 

dist(p, x) 2 dist(p, y) — dist(x,, y)=r, + 4,. 
But the path consisting of the two minimal geodesics from p = «(0) to 
a(r,) and from a(r,) to x, has length r, + 6,, so this path (which is 
viewed as a broken geodesic) has minimal length, so it is an unbroken 
geodesic by Corollary 6.3. Hence the path is actually equal to a, so 
a(r, + 6,) = x,, and therefore 


dist(x,, y) Ss (r, ar 61), 


so (dist,,,5,) is true, thus concluding the proof of the continuous induc- 
tion, and also concluding the proof the Hopf—Rinow theorem. 


Corollary 6.7. In the finite dimensional case the four completeness con- 
ditions COM 1 through COM 4 are equivalent to a fifth: 
COM 5. A closed dist,-bounded subset of X is compact. 


Proof. Assume COM 4. Let S be closed and bounded in X, and let 
X 9 €S. Let b be a bound for the diameter of S. Then by Theorem 6.6 
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(Hopf—Rinow), every point of S can be joined to x9 by a geodesic of 
length < b, so S is contained in the image under exp,, of the closed ball 
of radius b in T,,X, so S is compact, thus proving COM 5. 

Assume COM 5. Let {x,} be a Cauchy sequence in X. Then {x,} lies 
in a bounded set, whose closure is compact by assumption, so {x,} has a 
point of accumulation which is actually a limit in X. This proves COM 1, 
and concludes the proof of the corollary. 


Remark. In his thesis [McA 65], McAlpin gave the following example 
which shows a divergence of behavior in the case of infinite dimensional 
Hilbert manifolds. Let E be a Hilbert space with orthonormal basis {e,} 
(n= 0). Let T:E—>E be the linear map such that for a vector v= 
Y Xen € E 


(Sx) = ante 


where dy = 1 anda, =1+4 1/2" forn 21. Then 
llvl| S | Tol] S Fol, 


and therefore T is invertible in Laut(E). Let S be the unit sphere in E 
and let X = T(S), so X is a submanifold of E, to which we give the 
induced metric. Let « be a path joining eg to —ey in S. Then To is a 
path joining eg to —ey in X, and T is length increasing, that is 


L(a) S L(Ta). 
Hence the length of any path in X joining eg to —eg is 2 2, which is 
the minimal length of paths between the two points in E. However, let 


a, be the half great circle joining the two points in the upper (ép, e,)-half 
plane. Then 


L(To,) <(1 + 1/2")n > 2 = disty(e), —eo). 


Hence there is no minimal path joining the two points in X. Note that 
each Tu, is a geodesic in X joining the two points, because 


Im(Ta,,)u — Im(Ta,) 


is the fixed point set of the isometry F, defined by 


a(S ni) = X9ly + Xnln — oF Xx: 
#O0,n 


McAlpin refers to [Gros 64] for results on the distribution of degenerate 
points of the exponential map in similar examples. 
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Next we give another criterion for (X,g) to be complete. We start 
with a lemma. 


Lemma 6.8. Let f: Y— X be a C' map between Riemannian manifolds 
(Y, h) and (X,g). Assume that there is a constant C > 0 such that for 
all ye Y and we T,Y we have 


ldf(y)wll, 2 Cllwila- 


If y: [a,b] > Y is a piecewise C! path in Y, then 


L(f oy) 2 CL(y). 
Proof. We have 


b b 
Lf oy) = | Ifo WOlly at = | laf@)y' Ol, at 


b 
| Clly'Olla at 


a 


= CL,(y), 
as was to be shown. 


Let f: YX be a C' map of manifolds. We say that f has the 
unique path lifting property if given a point x e¢ X, a piecewise C! path a 
in X starting from x, and a point ye Y such that f(y) =x, then there 
exists a unique piecewise C' path y in Y such that foy =a and y starts 
at y. 


Theorem 6.9. Let f: YX be a local C* isomorphism of a Riemannian 
manifold (Y,h) into a Riemannian manifold (X, g). Assume that (Y, h) is 
complete, and X is connected. Also assume that there is a constant 
C > 0 suh that for all ye Y and we T,Y we have 


ldf(y)wll, = Cllwlln- 


Then f is surjective, f is a covering and has the unique path lifting 
property, and (X, g) is complete. 


Proof. The proof is in three steps. The first step is to prove that f is 
surjective and has the unique path lifting property. Let xe X, x = f(y). 
Every point in X can be joined to x by a piecewise C! path. Let 
a: [a,b] X be such a path, joining a(a) = x with «(b). We shall prove 
that « can be lifted uniquely to a path in Y starting from y. This will 
prove the first step. Let S be the set of elements t € [a,b] such that the 
path « restricted to [0, t] can be lifted uniquely to a path y» starting at y. 
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Without loss of generality, we may assume that a <b. The set S is not 
empty because ae S, and it is open because f is a local isomorphism. So 
it remains to show that S is closed. Let {t,} be a sequence in S in- 
creasing to the least upper bound by of S. Then {a(t,)} converges to 
a(bo), and by Lemma 6.8 the lengths of the lifted path between y(t,) and 
y(t,,) tend to 0 as m, n tend to infinity, so the sequence {y(t,)} is Cauchy 
in Y, converging to some element yo since Y is assumed complete. Then 
f(¥o) = aby), so S is closed, whence S = X by assumption. Therefore f 
is surjective, and we have also proved the existence and uniqueness of 
path liftings. 

The next step in the proof is to reduce the theorem to the case when the 
map f is a local isometry. We do this as follows. Let g* = f*(g) be the 
pull-back of the metric g from X to Y by f. Then for all ye Y and 
w € T,Y we have 


IIwllge = laf(y)wll, 2 Cllwila- 


Hence on Y we find that dist,, 2 C dist,. We now claim that Y is 
complete for the distance dist,.. To see this, first observe that if {y,} is 
g*-Cauchy, then {y,} is also h-Cauchy, so {y,} is h-convergent to an 
element y)e¢ Y. Then {f(y,)} converges to f(y). But f induces an iso- 
morphism from some neighborhood V of yo to an open neighborhood of 
f(¥o), and hence for all but a finite number of n, the points f(y,) lie in 
S(V), so {y,} is also g*-convergent to yo since g* = f*(g). This proves 
that Y is g*-complete. Furthermore, we have the inequality 


distye(V1, Y2) = dist,(f(y1), f(v2)) = for all yy, yr e ¥. 


In this final step, we prove that f is a covering. Since Y is g*-complete, 
this will also prove that (X, g) is complete, and will conclude the proof of 
the theorem. By the second step, we may assume without loss of general- 
ity that f is a local isometry, and that 


(*) dist,(y1, Y2) 2 dist,(f(y1), f(y2)) for all y,, y.€Y. 


Let x e X. From Theorem 6.4 we know that 
exp,: B,(0,, r) > B,(x, r) 


is an isomorphism for all r sufficiently small, say r<c with c>0. Let 
ye f +(x). Since f is a local isometry, the following diagram is commu- 
tative (using (*)): 


B,0,,7) 2. B,0,.) 


exp, exp, 


B,(y, 1) spire B,(x, r) 
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Note that the right vertical arrow is a differential isomorphism because 
we have picked r small enough, but so far we have made no such 
assertion for the left vertical arrow. For the proof of the theorem, it will 
suffice to show that f~'B,(x,r) is the disjoint union of the balls B,(y, r) 
for ye f +(x), if r is taken small enough. We take r so small that given 
x’ € B,(x,r) there is a unique geodesic in B,(x,r) joining x to x’ (namely 
exp,(tv) for some v). Then, first, we have f(B,(y,7)) < B,(x,1r), so the 
union is contained in f~'B,(x,r). Conversely, given a point ze f~'B,(x, r), 
we can join f(z) to x by a geodesic of length <r in B,(x,r), and by the 
path lifting property already proved in step 1, we can join z to a point y 
in f~1(x) by a geodesic of the same length, so 


‘ie (B(x, r)) = U B,(y; r), 


where the union is taken over y € f~‘(x). Finally, let y,, y. € f~'(x) and 
suppose y, # y,. We claim that B,(y,,r) is disjoint from B,(y2,7). Sup- 
pose there is some point z in the intersection. Then z can be joined to 
y, by a geodesic «, in B,(y,,r), and z can also be joined to y, by a 
geodesic a, in B,(y2,r), and these geodesics are distinct. Their images 
under f are geodesics in B,(x, r) joining x with f(z). By the uniqueness of 
path lifting, this would mean we have two distinct geodesics in B,(x, r) 
joining x and z, and that these geodesics have length <r. This contra- 
dicts the local uniqueness statement, and proves that the balls B,(y,, 7) 
and B,(y2,1r) are disjoint. This concludes the proof of the theorem. 


Remark. In the next chapter, under a condition of seminegative curva- 
ture (to be defined), we shall take Y = T,X, and we shall prove that 


f=exp,: LX ~X 


satisfies the hypotheses of Theorem 6.9, and therefore in particular that 
geodesic completeness implies completeness. In this manner, we shall be 
able to replace the local compactness condition by a curvature condition 
to insure the equivalence between the two notions of completeness. The 
whole technique goes back to Hadamard [Ha 1898] in the case of sur- 
faces with seminegative curvature, and Cartan [Ca 28] in the general 
case, still in this context of seminegative curvature. The notion of a 
“covering space” was not so clear during this early period. Except for a 
minor variation, the theorem is apparently due to Ambrose [Am 56], and 
occurs in the standard treatments of differential geometry as in [He 62] 
later replaced by [He 78], Chapter I, Lemma 13.4; [KoN 63], Chapter 
IV, Theorem 4.6 and Chapter VIII, §8, Theorem 8.1 and especially Lemma 
1. The theorem is at the base of the Cartan—Hadamard theorem, to be 
proved later. 


CHAPTER IX 


Curvature 


This chapter is a continuation of the preceding one, and is concerned 
with the iteration of covariant derivatives, from a formal point of view, 
and also from the point of view of their effect on the geometry of the 
manifold. 


IX, §1. THE RIEMANN TENSOR 


Let X be a manifold with a spray, and the covariant derivative D asso- 
ciated with the spray. If €, yn, € are vector fields on X, we are concerned 
with the operator 


which is a linear map of TX into itself. 


Proposition 1.1. There exists a unique tensor field R, section of 
L3(TX, TX), i.e. arising from the functor E+> L3(E, E) (continuous tri- 
linear maps of E into itself) such that for all vector fields &, yn, € we 
have 


R(é, n, €) = D-D,o — D, Deo — Drz,mS- 


Proof. The expression on the right-hand side gives a well-defined 
vector field on X. To show that this association comes from a tensor 
field, we can compute in a chart. To do this, we use the local expression 
for the covariant derivative given in Theorem 2.1 of Chapter VIII. So for 
the rest of the argument, €, 7, ¢ stand for €,, my, Cy in a chart U. Then, 


226 CURVATURE (IX, §1] 


for example, we have 
(1) D,¢ = ¢'-4 — B(n, ¢). 


We determine D.(D,¢) by substitution in this formula. As a first step, we 
have to write down the derivative 


(D,oy°o =O" en + Con’ & — Biy’C, C) — Bin, CC) — BUD, E, ¢). 
Then it follows that 


DADC) = CE + Con E = B'S 0) — Bay, C8) — Bn, &) 
+ B(B(n, 0), €). 


Permuting & and y gives us the second term. Using the local expression 
for the bracket 


[E.nl=n'-o—C-n 


as well as (1) will give us the third term. The reader will then verify that 
all the expressions containing a derivative cancel, leaving only bilinear 
expressions involving B (possible repeated), €, 4, and ¢. This proves 
Proposition 1.1. 


In addition, after the cancellation of the terms with derivatives, we 
obtain a local expression for R, namely: 


Proposition 1.2. Letting €, 7, € represent vector fields in a chart: 


Remark. There is no universal convention as to the sign of R. I use 
the same sign as [KoN 63], [ChE 75], [He 78], and [BGV 92], but the 
opposite sign to [BGM 71], [HGL 87/93], and [Mi 63]. For further 
comments, see the discussion after the definition of sectional curvature. 


Let v, w, ze T,X. It is customary to write 
R(v, w, 2) = R(v, w)z = Rx (E(x), n(x), $0) 
= R(E, n, C)(x), 
if €, 7, € are any vector fields such that &(x) = v, n(x) = w, C(x) = z. One 


writes 
R(é, 9): TTX ~ TTX 
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for the linear map of TX into itself, given by 
R(G, ) = DeD, — D, Dz — Dez.m- 


As a function of two variables, according to this definition, one may view 
R as a section of the bundle L?(TX, L(TX, TX)), which is formed by 
applying the functor E* +> L?(E, L(E, E)) to the tangent bundle. 

Next we list some identities. 


Proposition 1.3. 
R(v, w) = —R(w, v)  (skew-symmetry). 
R(v, w, z) + R(w, z, v) + R(z, v, w) =0 (cyclicity, Bianchi’s identity). 


Proof. The first relation is obvious from the definition. The second 
one is immediate from the local representation of Proposition 1.2. 


For the next two properties, we assume that the spray is the one 
associated with a metric, so the covariant derivative is the metric deriva- 
tive. We let < , >, be the scalar product associated with the metric. 
Then we define a function of four variables 


R(v, w, 2, u) = (Riv, w)z, u), for v, w, z, ue T,X. 


Then R is a tensor of type L*, that is a section of L*(TX) = L*(TX, R). 
We shall call R the Riemann 4-tensor (canonical with respect to g). We 
call —R the curvature tensor. The properties of Proposition 1.3 may be 
formulated for this 4-tensor, and we shall see in a moment that it also 
satisfies two other important properties. Thus it is useful to make a 
general definition. A tensor R of type L* is called a tensor of Riemann 
type if it satisfies the following four properties: 


RIEM 1. R(v, w, z,u) = —R(w, v, z, u) 

RIEM 2. R(v, w, z,u) = —R(v, w, u, Z) 

RIEM 3. R(v, w, z,u) + R(w, z, v, vu) + R(z, v, w, u) = 0 

RIEM 4. R(v, w, z, u) = R(z, u, v, w). 
The first two conditions express the property of being alternating in the 
first two variables, and also in the last two variables. The third condi- 
tion is called the Bianchi identity, and expresses the property that the 


cyclic symmetrization of the tensor is 0. The fourth property states that 
the tensor is symmetric in the pairs of variables (v,w) and (z,u). In 
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particular, we note right away that from RIEM 4, we obtain: 
R(v, w, v, u) is symmetric in (w, u), that is R(v, w, v, u) = R(v, u, v, w). 


We shall make more comments on these properties after the next propo- 
sition, which justifies the terminology. 


Proposition 1.4. On a pseudo Riemannian manifold, the Riemann tensor 
satisfies all the above four properties. Furthermore, RIEM 4 follows 
from RIEM 1, 2, 3. 


Proof. Properties RIEM 1 and RIEM 3 have been proved in Propo- 
sition, 1.3. Property RIEM_2_amounts to proving that R(v, w, z,z) = 0 
for all v, w, z; or in terms of vector fields, R(é, y, ¢,¢) = 0. We will need 
to differentiate. Since all the terms with derivatives vanish in the local 
formula of Proposition 1.2, we may assume without loss of generality 
that [€,74] =0. Then 


CRE, nS, Og = «DD, — D,Dzf, 0g, 


and we must show that the right side is symmetric in €, yn. But [é,7] =0 
implies that 
LL KE OD, 


is symmetric in €, 4. Since we are dealing with the metric covariant 
derivative, it follows that 


RS; Os = 2<D,¢, os 


and therefore 


LL,86; ae = 2<D:D,¢, Cy + 2<Df, DOs 


from which it follows at once that <D:D,¢, o>, is symmetric in ¢, y, thus 
proving RIEM 2. 

The formula RIEM 4 is a formal consequence of the preceding three 
formulas. It is basically an exercise in algebra, which we carry out. In 
the cyclic identity RIEM 3, interchange u with z, v, w successively, and 
add the resulting three relations. One gets, using RIEM 1 and RIEM 3: 


(*) R(u, v, w, z) + R(u, w, z, v) + R(u, z, v, w) = 0. 
From cyclicity and RIEM 1, one gets 


R(z, v, u, w) = R(u, v, z, w) — R(u, z,v,w) or 


R(u, z, v, w) = R(u, v, z, w) — R(z, v, u, w). 


[IX, §1] THE RIEMANN TENSOR 229 


We substitute the value on the left in (*), and use RIEM 1 to conclude 
the proof of RIEM 4. 


We shall be dealing with a contraction of the canonical 4-tensor. We 
defined the canonical 2-tensor R, by 


R,(v, w) = R(v, w, v, w). 


Proposition 1.5. The canonical 2-tensor determines the Riemann tensor. 
Or similarly, if the canonical tensor R satisfies 


R(v, w, v, w) = 0 for all v, w, 
then R= 0. 


Proof. Say we prove the second assertion first. From RIEM 4, which 
implies that R(v, w, v, z) is symmetric in (w, z), if R(v, w, v, w) = 0 for all v, 
w then R(v, w, v, z) = 0 for all v, w, z. From the alternating properties of 
RIEM 1 and RIEM 2, it follows that R = 0 identically. 

To show that the canonical 2-tensor determines the Riemann tensor, 
we note that the problem is essentially equivalent to the other statement, 
but one may argue directly as when one recovers a symmetric bilinear 
form from a quadratic form, namely 


02 
=~ R(v + tz, w + su,v + tz, w + su 
E Us My 


e2 
~ Otds 
= 6R(v, w, Z, u). 


R(v + tu, w + sz,v + tu,w + s3| 


s=t=0 


This proves the proposition. 


An important case arises when R,20. We define (X,g) to have 
seminegative curvature if R, 20. The following discussion explains this 
terminology in terms of its historical development. 


Curvature discussion 


A large part of the theory we are developing is fundamentally a theory of 
commutative rings with certain types of derivation, and possibly scalar 
products, in which positivity or negativity plays no role. This theory 
contains a number of formulas with precise equality between various 
terms. There would be some value in redoing this chapter and the 
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preceding one completely in such a context of commutative differential 
algebra. At some point, for certain applications, the positivity or negativ- 
ity properties of the real numbers are used, as in the second statement of 
Proposition 2.6 below. For such applications, the question arises as to 
what is the natural sign to be used, if indeed there is a natural sign. 

Historically, the theory arose in a geometric context, based on geomet- 
ric intuition. To each pair of vectors (v, w) in a tangent space T,X, we 
define the area square of the parallelogram spanned by these vectors to 
be 


Ar,(v, w)? = v?w? — <v, wy?. 


As usual, v7 = <v, v),. Then, when Ar,(v, w)? #0, we define the sectional 
curvature to be 
R(v, w, v, w) 


Sec,(v, w) = “Ar, (0, w)? | 
g\"> 


In the Riemannian case, Ar,(v,w) #0 if and only if v, w are linearly 
independent. If v? and w? > 0, then the value on the right depends only 
on the unit vectors in the direction of v, w respectively; and if v, w are 
orthogonal unit vectors, then 


Sec,(v, w) = — R(v, w, v, w). 


In the Riemannian case, it is immediate that the value of the sectional 
curvature on (v,w) depends only on the plane generated by v and w, 
because of the skew-symmetry of RIEM 1 and RIEM 2. For the com- 
plex analogue, see [La 87], Chapter V, §3. 

Let ce R* be a positive number. The multiple cg is called a scaling of 
the metric g. Since the covariant derivative D® is the same as D%, it 
follows from the definitions that under scaling, the curvature changes as 


Sec,, = c~' Sec,. 
Directly from the definition, we then see in the Riemannian case that: 
The sectional curvature has constant value —1 if and only if 
R,(v, w) = v?w? — <v,w>? for all v, we T,X. 


Viewing v as fixed, the above expression is quadratic in w, and the 
corresponding symmetric bilinear form is 


R(v, w, v, z) = v?* Cw, 2, — Cv, W), <0, 2,9. 
Thus R(v, w)v is given by 


R(v, wv = v?w —<v,w),v for all v, we T,X. 
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Similarly, the sectional curvature has constant value +1 if and only if 
the analogous formula holds with a minus sign inserted on one side, so that 
for instance 


— R(v, w)v = v?w — Cv, W)yv. 


In the applications of this book (the rest of this chapter, the Cartan— 
Hadamard theorem, the variation formulas of §4, etc.) what matters is 
not the “curvature” as defined above, but the canonical tensor R itself. 
Furthermore, formulas in these applications come out much neater with 
R than with “curvature” for two reasons: 

First, for such formulas, dividing to normalize as in the curvature 
quotient is unnatural, partly because the term by which one divides, 
algebraically, may be equal to 0 unless extra conditions are imposed. 

Second, even for inequalities as distinguished from equalities, the natu- 
ral condition which arises is R(v, w, v, w) 20 rather than curvature < 0. 
If one takes R with the sign as we have defined it, then only plus signs 
occur in all the formulas (cf. Lemma 2.5 and the variation formula, 
Theorem 4.3, for instance), This universal occurrence of plus signs is 
obscured if one introduces minus signs artificially. I regard this universal 
occurrence of plus signs as structurally important. 

The naturality of R in the real case is similar to the naturality of its 
counterpart in the complex case, where formulas involving positivity come 
out neatly by using the analogue of R rather than its negative (as already 
noted by Griffiths). Cf. [La 87], the comments pp. 136-137 about holo- 
morphic sectional curvature. The lesson is that the “curvature” in classi- 
cal terminology is minus the natural object R (aside from questions of 
normalizing the dilation to the unit sphere). 

Classically, starting with surface theory, people wanted some formulas 
such as Gauss-Bonnet or formulas relating “curvature” and Betti num- 
bers, using +R, to come out so that on the sphere, one gets a value of 
certain integral to be 4x and not —4z. So they picked the minus sign, 
and gave the notion —R (normalized) the name of curvature, which 
makes the sphere have positive curvature. The bottom line is that de- 
pending on what applications one makes, both R and —R are “natural.” 
However, from the point of view of universal algebraic manipulations, R 
is the clearest functorial notion. 

One can define two other curvatures, at least. Actually, all we need is 
a tensor of curvature type. From such a tensor R, we obtain two other 
tensors. First observe that to each pair of vectors v, ze E we can asso- 
ciate an endomorphism of E, denoted by Ric(v, z), and defined by 


Rica(v, z)w = R(v, w)z. 
Thus Ric gives a bilinear map 


Ricg: E x E- L({E, EB). 
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Applied to the tangent bundle, and the Riemann tensor R itself, Ric is 
called the Ricci tensor. 
Furthermore, in the finite dimensional case, the trace 


tr: L(E, E) > R 
is a continuous linear map. Then the composite 
Scr =tro Ricp 


is a function of pairs of vectors, which when applied to the tangent 
bundle defines what is called the scalar curvature. In the infinite dimen- 
sional case, one has to give an additional structure, assuming that the 
Ricci tensor is of “trace class”, or defining the sectional curvature with 
respect to a given “trace,” i.e. a continuous functional on L(E, E) which is 
equal on products AB and BA. But this now leads far afield. 

Suppose we are in the Riemannian case. We can then give an explicit 
formula for the scalar curvature. In the neighborhood of a point, we can 
find vector fields €,, ...,¢, (with n = dim X) which are orthonormal, by 
the usual orthogonalization process. Such a sequence of vector fields is 
called an orthonormal frame at the point. 


Proposition 1.6. Let {&,,...,¢,} be an orthonormal frame on an open 
set. Then for vector fields —, n we have 


Sea(E.m) = RUE fi. 80) 


Proof. This is immediate from the definition of the trace of an endo- 
morphism of a finite dimensional vector space. 


The conditions RIEM 1 and RIEM 2 express the property of de- 
pending only on the wedge product of each pair of variables v ~ w and 
z Au. Property RIEM 4 is a symmetric property in these pairs of vari- 
ables. Thus we may say that the four-variable tensor R defines a sym- 
metric bilinear form on /\? TX, which we denote by 


R*: 2 TX a A?TX OR, such that R*(v A w,z A u) = R(v, w, z, uv). 


On the other hand, we also have the pseudo Riemannian metric, which 
induces a non-singular scalar product on /\? TX by the formula 


C0, Z), <0, UD, 
CW, Z)g <W, UY, 


<v A w,Z A uy, = det 
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These scalar products are of course evaluated at each point x eX, v, w, 
z, ue T,X. The scalar product on /\? TX with respect to the non- 
singular symmetric form g then corresponds to a symmetric operator 
which is called the curvature operator. 

In the infinite dimensional case, from the self duality, each tangent 
space can be interpreted as the dual of its dual, and the wedge product is 
defined as in Chapter V, §3 so the above notions still make sense. 

Readers wanting to pursue the topic of curvature are now referred to 
other books on differential geometry, including [BGM 71], [ChE 75], 
[doC 92], and [GHL 87/93]. 


IX, §2. JACOBI LIFTS 


Let (X,g) be pseudo Riemannian. We write w? for <w, W),, and v lw 
for <v,w>, = 0. We let a: [a,b]— X be a geodesic. Unless otherwised 
specified, (X, g) is not necessarily Riemannian. 


A lift 7 € Lift(«) to the tangent bundle will be called a Jacobi lift, or 
more classically a Jacobi field, if it satisfies the Jacobi differential equation 


Dzn = R(w, no’. 


Theorem 3.1 of Chapter VIII and Proposition 1.2 in the preceding sec- 
tion of the present chapter show that locally, the above equation is a 
linear differential equation. Therefore, by the existence and uniqueness 
theorem for linear differential equations, we get: 


Theorem 2.1. Let (X,g) be pseudo Riemannian, let a: [a,b] X be a 
geodesic. Given vectors z, Ww € Tyq)X, there exists a unique Jacobi lift 
N=Nz,y of «to TX such that 


n(a) =z and D,-n(a) = w. 


In particular, the set of Jacobi lifts of « is a vector space linearly 
isomorphic to Tyq) X Tya) under the map (z, W) +> Nz. 


We denote the space of Jacobi lifts of « by Jac(a). 
Let v € T, and consider the unique geodesic 


a(t) = exp,(tv) 


such that «(0) = 0 and a’(0) = v, with « defined on an open interval. Let 
we T,X and let y,, be the unique Jacobi lift of « such that 


n(0)=0 and — Dyn,,(0) = w. 
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Example. Let w = v. Then 
no(t) = ta’(t). 


Proof. One verifies at once that 7,(0) = 0, and since D,,« = 0, we also 
have 
Dyn (t)=a(t) and D2n, =0= R(a’, a’)a’. 


Remark 1. Defining a Jacobi lift implicitly has the geodesic a in its 
definition. If, say, a=0, this geodesic is uniquely determined by its 
initial condition «'(0) = v, so the Jacobi lift is also determined by v. Thus 
one could write yn‘) for the Jacobi lift. In the present discussions, this 
won’t be necessary since we deal systematically with a fixed a. 


Remark 2. In a chart, the derivative 7'(0) can be computed in the 
naive way since 4: J — TX is defined on an interval. The naive deriva- 
tive and the covariant derivative D,-y differ locally in a chart by a term 
linear in 7, which therefore vanishes at 0 since n(0) = 0. Hence the naive 
derivative and the covariant derivative have the same value at 0, that is 


n'(0) = Dy-n(0). 


We note that, « being fixed, the association w+ n,, is linear. We now 
have the possibility of orthogonalization. 


Proposition 2.2. Let (X,g) be pseudo Riemannian. Let «: [a,b] X be 
a geodesic, and let n be a Jacobi lift of «. Then there are numbers c, d 
such that 

<n, &'>,(t) = c(t — a) +d. 


In fact, d= <n, «'),(a) and c= <D,n, «'>,(a). If n(a) and D,n(a) are 
orthogonal to «'(a), then n(t) is orthogonal to o(t) for all t. 


Proof. Using the metric derivative, and D,.«’ = 0 since « is a geodesic, 
we find that 0<n, a’>, = <D,n, a >,, and then 


a <n, w >, = <D2n, & >, = R(a’, n, a’, a’) = 0. 


Hence <y,«’>, is a linear function, whose coefficients are immediately 
determined to be those written down in the proposition. 


Proposition 2.3. As above, let «'(0) =v. Write w= cv + w, with <w,, VY, 
= 0. Then n,, has the decomposition 


Nw = Cy + Nw, also written n,,(t) = cta’(t) + n,,(t). 


Furthermore n,,, is orthogonal to «’, that is Nw,» © Dg = 0. 
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Proof. Immediate from Proposition 2.2. 
Next we get the similar orthogonalization of D,-n,,. 


Proposition 2.4. Notation as in Proposition 2.3, we have an orthogonal 
decomposition 


Dy Nw = CDyNy + DyNw, also written Dy Ny(t) = ca’(t) + Dy ny, (0)- 


In other words, if w,L«'(0), then D,.n,, -L«'. Furthermore (DyNy)* is 
constant. 


Proof. For the first assertion, we take the derivative and use Proposi- 
tion 2.3 to get 


0 = OXNw,> a>, = CDi Nw, > at’ >4. 
For the second, we then obtain for 7 = n,: 


0<Dyn, Dy, = 2<Dzn, Dy, 
= 2¢R(a’, na’, Dy n),- 


If ny =n, so n,(t) = ta’(t), then the right side is 0 because R, is alternating 
in its last two variables. If 7 = n,, with w La’(0), then by the first asser- 
tion, D,-y is orthogonal to a’, so the right side is also 0. This concludes 
the proof of Proposition 2.4. 


The next lemma will give us information on the rate of growth of a 
Jacobi lift, and the convexity of its square. 


Lemma 2.5. Assume (X,g) Riemannian. Let yn be a Jacobi lift of «a. 
Let f(t) = |ln(0)||. Then at those values of t >0 such that n(t) #0, we 
have 


1 
f" = 3((Den)?n? — (Dyn, mg) + 


— Tale R,(o, ). 


1 
\Inl| 
Proof. Straightforward calculus, using the covariant derivative. The 
first derivative f’ is given by 


f' =(n?)*? <n, Dyn, = — <n, Dy), 


1 
\Inl 
Then f” is computed by using the rule for the derivative of a product. 
In the term containing <D2n, Ng, we replace D2n by R(a’,n)a’ (using the 
definition of a Jacobi lift) to conclude the proof of the lemma. 
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In the above lemma, we note that on the right side, the first term is 
= 0, and the second term is = 0 if R, = 0. 


Proposition 2.6. Let «:[0,b]— X be a geodesic. Let w€ Tyo,X, w #0. 
Let ny =o, = be the unique Jacobi lift satisfying 


Nyw(0)=0 and = Dyn (0) = w. 


If (X,g) is Riemannian and R, = 0 (so (X,g) has seminegative curva- 
ture), then for te [0, b] we have 


IIn(t)|| 2 ||wllt and in particular —_||n(1)|| 2 wll if b = 1. 


Proof. Let h(t) = |/n(t)\| — ||wllt for OS t<b. Then h is continuous, 
h(O) = 0, and by Lemma 2.5, h” = f" = 0 whenever n(t) #0. One cannot 
have n(t) = 0 for arbitrarily small values of t 4 0, otherwise D,.y(0) would 
be 0 (because in a chart U, (0) = D,-n(0)). In fact, we shall prove that 
there is no value of t #0 such that y(t)=0. Suppose there is such a 
value, and let ty be the smallest value > 0. In the interval (0,t9) we 
have h” = 0 by Lemma 2.5, so h’ is increasing. But the beginning of the 
Taylor expansion of y in a chart is 


ny(t) = wt + O(t?), so lim f(t) = Iw]. 
t-0 


Furthermore, h’(0) exists and is equal to 0, so h'2=0 on [0, tg), so h is 
increasing, and there cannot be a value ty) > 0 with n(ty)=0. Then the 
above argument applies on the whole interval [0, b] to prove the desired 
inequality on the whole interval. This concludes the proof of Proposition 
2.6. 


Remark. These results essentially stem from Cartan [Ca 28]. The 
above version without coordinates, which extends to the infinite dimen- 
sional case, comes from [BiC 64]. Readers may find it instructive 
to compare this version with the one involving coordinates given in 
[He 78], pp. 71-73. 


Proposition 2.6 is used for the subsequent application to the Cartan— 
Hadamard theorem (Theorem 3.7), based on Theorem 6.9 of Chapter 
VIII, whose origin is in Hadamard for surfaces [Ha 1898] and Cartan in 
general. (Here and at several other places, I rely on Helgason’s very 
useful bibliographical comments.) 


Variations of geodesics 
By a variation of a curve ~ one means a C? map 


o:[a,b]x Jax 
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where J is some interval containing 0, such that o(s, 0) = «(s) for all s. 
One then writes 
a(s, t) = a,(s), 


and one views {a,} as a family of curves defined on [a,b]. If all curves 
a, are geodesics for teJ then one says that o is a variation through 
geodesics. 


Lemma 2.7. Let a: J, x J, > X be a C* map. Then on lifts of o to the 
tangent bundle, we have the equality of operators 


D,D, — D,D, — R(Q,6, 0,0). 


Proof. The formula can be verified in a chart. It follows directly from 
the definitions, especially using the local expression of Proposition 1.2. 


Proposition 2.8. Let o: [a,b] x JX be a variation of a geodesic « 
through geodesics. Let 

n(s) = 0,a(s, 0). 
Then n is a Jacobi lift of «. 


Proof. Given o, we have 


D?0,06 = D,D,6,0 = D,D,0,6 by Lemma 5.3 of Chapter VIII 
= D,D,0,0 + R(0,6, 0,0)6,0 by Lemma 2.7. 


But D, 0,a(s, t) = 0 because «, is a geodesic for each t, whence 
Din = R(a’, no’, 
so y is a Jacobi lift of «, as was to be shown. 
We will not need the next result, but put it in for completeness. 


Theorem 2.9. Let «:[0,b]— X be a geodesic, parametrized by arc 
length. Let n be a Jacobi lift of «. Then there exists a variation 


o:[0, b] x (-g, 8) 9X 
through geodesics, such that 
n(s) = 0,0(s, 0) for all s. 
Proof. Let B:(—«, &)— X be the geodesic such that 


BO)=a(0) and = B(0) = n(0). 
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Let v = a'(0) and w = D,.n(0). Let 
C(t) = Pa (v + tw) = Pay(v) + tPo(w), 


where Pz) is parallel translation along £. Then from the definition of 
parallel translation, 


€(0) = v and D,.C(0) = w = D, n(0). 
Let 
G(S, t) = €xpg SC(t). 


We have oa(s, 0) = a(s), and for each t, the curve s +> a(s, t) is a geodesic, 
so o is a variation of « through geodesics. We still have to show that 
n(s) = d,0(s, 0). Let &(s) = d,0(s, 0). Then & is a Jacobi lift of «, so it 
suffices to prove that € has the same initial conditions as 7. We have 
first 

£(0) = 0,0(0, 0) = n(0); 
and in addition, 


D, €(0) = D,é,0(0, 0) = D,0,0(0, 0) by Chapter VIII, Lemma 5.3 
= (Dy-d €XPp(o)(0)C )(0) 
= D,C(0) 
= D,7(0), 
thus concluding the proof. 

Example. Constant curvature. Let (X,g) be Riemannian. As an exam- 
ple, we shall now determine more explicitly the Jacobi lifts when (X, g) 
has constant curvature. Since the covariant derivative is invariant under 
a scaling of the metric, we may as well assume that the curvature is 0 or 


+1. In the next three proposition, we let x ¢ X and we let ve T,X be a 
unit vector. As usual, we let a = «, be the geodesic 


a(t) = exp,(tv). 


For we T,X we let n,, =n be the Jacobi lift of «, satisfying the usual 
initial conditions 


ny(0)=0 and  D,1,,(0) = w. 
Finally, we let y,, = y(” be parallel translation of w along «a,, so 


yw0)=w and  D,y7, = 0. 
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Proposition 2.10. Assume that the curvature is 0, or equivalently that 
the Riemann tensor R is identically 0. Then for all we T;.X we have 


Nw(t) = tyy(t). 


Proof. The two curves t+> 7,,(t) and t+ ty,(t) have the same initial 
conditions. Also they satisfy the same differential equation, namely 


Din, =0 and Dz (ty,(t)) = 0. 
Hence they are equal, thereby proving the proposition. 


The next two propositions deal with constant curvature +1. We 
recall that we wrote down the Riemann tensor explicitly in those cases in 
§1. We may therefore write down the differential equation for a Jacobi 
lift more explicitly in those cases, as follows. 


Proposition 2.11. Assume that (X,g) has constant curvature —1. Then 
the Jacobi differential equation has the form 

(1) Deny = Nw — <Nws a g0t’. 

Furthermore, if we orthogonalize w with respect to v, so write 


W=Co0+cCyU with Co, Cc, € R and a unit vector u 1 v, 


then 


(2) Nw(t) = Cota’(t) + (sinh t)c, ,(2). 


Proof. The orthogonalization of Jacobi lifts comes from Proposition 
2.3, so we want to identify the orthogonal components of the Jacobi lift 
of a, with scalar multiples of parallel translation. It suffices to do so 
when w =v and w=uLv separately. The example following Theorem 
2.1 already gives us the v-component, so we may assume w= u. In this 
case, the reader will verify that the two curves 


tr 7,(t) and t + (sinh f)y,(t) 


have the same initial conditions at 0 (for their value, and the value of 
their first covariant derivative). They also satisfy the same differential 
equation, namely 

Diy = Nw 


and similarly for the other curve, since D,-y, = 0. Hence the two curves 
are equal, as was to be shown. 


240 CURVATURE [IX, §3] 


Thirdly we deal with constant positive curvature. 


Proposition 2.12. Assume X has constant curvature +1. Let xe X. 
Then the same formulas hold as in Proposition 2.11, except for a minus 
sign on one side in formula (1), and with sinht replaced by sint in 
formula (2). 


Proof. The arguments are the same. Using sin t instead of sinh t just 
guarantees that the differential equation 


D2 N= Nu 
is satisfied, with the minus sign. 
This concludes our analysis of the Jacobi lifts in the cases of constant 
curvature. 
The Jacobi differential equation has at least two main aspects. One of 
them will be applied to a study of the differential of the exponential map 


in the next section. The other will be applied to variational questions 
in §4. 


IX, §3. APPLICATION OF JACOBI LIFTS TO dexp, 


We continue to assume that (X,g) is pseudo Riemannian, unless other- 
wise specified. 


We are interested in Jacobi lifts because they give precise information 
concerning the differential of the exponential map, for instance as in the 


following result. In the statement, if ve T, then we identify T, 7, with T., 
as we usually do for a Banach space. 


Theorem 3.1. Let xe X and ve T,. Let « (defined on an open interval 
containing 0) be the geodesic such that a(0)=x and a (0)=v. Let 
we T, and let n,, =o, be the Jacobi lift of « such that 


Nw(0) ae 0 and DyNw(0) =W. 


Then for r > 0, in the interval of definition of «, we have the formula 


dexp,(rv)w = "al 


In particular, w lies in the kernel of dexp,(rv) if and only if n,(r) = 0. 
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Proof. Let 
a(s, t) = exp,(s(v + tw)). 


Then a, is a geodesic for each t, and oo(s) = exp,(sv) = a(s), so ao is a 
variation of « through geodesics. Let n(s) = d,0(s, 0). Then 7 is a Jacobi 
lift of a by Proposition 2.8. Let f(s, t) = s(v + tw). Then 


d,0(s, t) = (dexp,)(S(s, 1)) 6f/6t) = (dexp,)(S(s, £))(sw). 


Hence 7(0) = 0. Furthermore this same expression yields the formula of 
the theorem, namely 


n(r) = (dexp,)(f(r, 0))rw = (dexp,)(rv)rw. 


Taking the limit as r>0 in the formula, noting that in a chart 
D,-n(0) = 7/(0), and using dexp,(0) = id proves that D,4(0) = w and con- 
cludes the proof of Proposition 3.1. 


The Jacobi lifts also allow us to give a more global version of the 
orthogonality property as in the Gauss lemma of Chapter VIII, Theorem 
5.6. 


Proposition 3.2 (Gauss Lemma, Global). Let (X,g) be pseudo Rieman- 
nian. Let x € X and ve T,X. Let the exponential map r +> exp,(rv) be 
defined on an open interval J. Then for all we T,X we have 


<dexp,(rv)v, dexp,(rv)w>, = (0, W),. 


Proof. Immediate from Theorem 3.1 and the orthogonalization of 
Proposition 2.3. 


Variation of a geodesic at its end point 


Next we shall give another way of constructing Jacobi lifts, which will 
not be used until Chapter X, Proposition 2.5. Readers interested in 
seeing at once the application of Jacobi lifts to the Cartan—Hadamard 
theorem, say, may omit the following construction. 


Let x, ye X with x #y be points such that y lies in the exponential 
image of a ball centered at 0, in T,X and such that the exponential map 
exp, is an isomorphism on this ball. 


Thus this ball provides a normal chart at x. Let a be the geodesic 
parametrized by arc length joining x to y, so there is a unit vector ue T, 
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such that 
a(s) = exp,(su) and y = exp,(ru) for some r>0. 


Thus dist,(x, y)=r. Let e be a unit vector in T,X, and let B be the 
geodesic such that 


B(O)=y (so B starts at y) and B'(0) =e. 


We consider an interval of the variable t such that f(t) is contained in 
the image of the previous ball around x. For each t we let a, be the 
unique geodesic from x to f(t), parametrized by arc length. Then {a,} is 
a variation of «, namely %) =, and it is a variation through geodesics, 
illustrated on the next figure, drawn when e is perpendicular to «’(r) to 
illustrate Proposition 3.3. 


B(t) 


x 


The above variation will be called the variation of « at its end point, in 
the direction of e. 


Proposition 3.3. Let y = exp,(ru) be in a normal chart at x as above, 
with the unit vector u. Let «(s) = exp,(su), and let {a,} be the variation 
of « at its end point y in the direction of the unit vector ee T,X. Also 
denote this variation by o, and let n(s) = 0,0(s, 0). Assume that e is 
orthogonal to a'(r). Then D,1 is orthogonal to a’, and n is the unique 
Jacobi lift of « such that 


n(0) = 0 and n(r) =e. 


Proof. First note the uniqueness. If there is another Jacobi lift having 
the last stated property, then the difference vanishes at 0 and r, and by 
Theorem 3.1 this difference must be 0 since the exponential map is 
assumed to be an isomorphism from a ball to its image, which contains 
y = exp(ru). 

Next, the variation o is given by the formula 


a(s, t) = a,(s) = exp,(su(t)) such that —_ exp, (s(t)u(t)) = B(O), 


[IX, §3] APPLICATION OF JACOBI LIFTS TO dexp, 243 


where u(t) is a unit vector, and s(t)u(t) is the vector whose exponential is 
B(t). The polar coordinates s(t) and u(t) depend as smoothly on t as the 
exponential map, or its inverse. Then 


620(s, t) = dexp,(su(t))su'(t), 
so that (since u = u(0)), 
n(s) = dexp,(su)su’(0) 
= Nu(oy(S)s 


because from Theorem 3.1, we see that D,.n(0) = u’(0). Since u(t)? = 1, it 
follows that u’(0) is perpendicular to «’(0) = u, so D,-n is orthogonal to a’. 
Furthermore 


B'(t) = dexp(s(t)u(t))(s(t)u'(t) + s‘()u(o), 
and since s(0) = r, we find 
e = B'(0) = dexp(ru)(ru’(0) + s‘(O)u) 
‘= dexp(ru)ru’(0) + dexp(ru)s’(Q)u. 


Since e is assumed orthogonal to «'(r) = dexp(ru)u, and u’(0) is also or- 
thogonal to u, we must have s’(0) = 0, whence the relation 


e = dexp(ru)ru’(0) or Nu) = e. 


This proves the proposition. 


Transpose of dexp, 


In the next results we are concerned with the differential of the exponen- 
tial map at arbitrary points, namely for ve T, such that exp, is defined 
on the segment [0, v], we are concerned with 


dexp,(v): T, > T,, where y = exp,(v), 


especially whether this map is an isomorphism, or what is its kernel. 
Theorem 3.1 describes a condition for an element w to be in the kernel 
in terms of a zero for a suitable Jacobi lift. We shall exploit this condi- 
tion to see that under some circumstances, there cannot be a non-trivial 
zero. We first give a lemma from Ambrose [Am 60]. 


Lemma 3.4. Let (X,g) be pseudo Riemannian. Let n, € be Jacobi lifts 
of a geodesic a. Then 


Dyn, Og — <n, Dab is constant. 
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Proof. We differentiate the above expression and expect to get 0. 
From the defining property of the covariant derivative, the derivative of 
the above expression is equal to 


«Den, > + (Dyn, Dyl> — «Dyn, Db) — <n, Del 
= (Din, 6> — (Del, 0). 
= R(o, UB a’, ¢) _ Ri’, G a, n) 
=0 
by the symmetry property of R. This proves the lemma. 


The next lemma, from McAlpin’s thesis [McA 65], describes the ad- 
joint of the differential of the exponential map. 


Lemma 3.5. Let (X,g) be pseudo Riemannian. Let « (defined at least 
on [0, 1]) be the geodesic such that a(0) = x and a'(0) = v. Let 


Z€ Tyas we Tyx0)> 
and let 
v* = —a'(1) = —P, 


where P is the parallel translation along a’. Then 


<dexpxo)(v) w, Za1) = <W; dexpy1)(0*)Z ao) 


Proof. Let € be the Jacobi lift of « such that €(1) = 0 and D,-€(1) = z. 
Let 4 be the Jacobi lift as in Proposition 3.1. Then 


<dexp,(v)w, z> = <n(1), Dy C(1)> = <Dyn(1), C(1)> + C = C, 
where C is the constant of Lemma 3.4. We compute C to be 
C = —<D,-n(0), €(0)> = —<w, 6(0)>. 


Let rev(a) be the reverse curve, so that rev(a)(t) = a(1 — t), and let € be 
the unique Jacobi lift of rev(a) such that 


€(0) =0 and Deeyiay' S(O) = 2. 


Then in fact &(t) = €(1 — t), and applying Proposition 3.1 concludes the 
proof. 
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Seminegative curvature 


We apply the above results to the case of seminegative curvature. The 
next proposition gives us a criterion for the kernel of the differential of 
the exponential to be trivial, and we use Jacobi lifts in the proof. 


Theorem 3.6. Let (X,g) be Riemannian. Assume (X, g) has seminegative 
curvature. Then for all xe X and ve T,, v #0, such that exp, is 
defined on the segment [0, v] in T,,, we have 


Iddexp,()w|ly = [lwp for all we T,X. 


In particular, 
Ker dexp,(v) = 0. 


Proof. Let n,, be the Jacobi lift as in Proposition 3.1, so that 
dexp,(v)w = n,,(1). 


The asserted inequality is then a special case of the inequality found 
in Proposition 2.6. This inequality implies that Ker dexp,(v) = 0, which 
concludes the proof. 


Observe that the estimate on the differential of the exponential states 
that the inverse dexp,(v)"! is bounded by 1, as a continuous linear map. 
Of course, so far, this inverse is defined only on the image of dexp,(v). 
In the finite dimensional case, invertibility is immediate. In the infinite 
dimensional case, it is in McAlpin’s thesis [McA 65], as follows. 


Theorem 3.7 (McAlpin [McA 65]). Let (X,g) be a Riemannian—Hilber- 
tian manifold with seminegative curvature, and let x € X. Assume that 
exp, is defined on all of T,, (what we called geodesically complete at x). 
Then for all v € T, the map dexp,(v) is a topological linear isomorphism, 
and in particular, exp, is a local isomorphism. 


Proof. We have already proved that dexp,(v) is injective and has a 
continuous inverse on its image. Lemma 3.5 shows that we can apply 
the same reasoning to the adjoint (dexp,(v))* = dexp,(v*) for y = exp,(v), 
so this adjoint also has kernel 0. Hence dexp,(v) is surjective, thereby 
concluding the proof of the theorem. 


The next theorem was proved by Hadamard for surfaces [Ha 1898], 
by Cartan for finite dimensional Riemannian manifolds [Ca 28], and by 
McAlpin in the Hilbertian case [McA 65]. 
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Theorem 3.8 (Cartan—Hadamard). Let (X, g) be a Riemannian manifold, 
connected, and such that exp, is defined on all of T, for some x € X (so 
geodesically complete. If R, 20 (i.e. X has seminegative curvature), 
then the exponential map exp,: T,X — X is a covering. In particular, if 
X is simply connected, then exp, is an isomorphism. 


Proof. We have already proved that exp, is a local isomorphism. 
There remains to prove that exp, is surjective, and that it is a covering. 
But all the work has been done, because we simply apply Theorem 6.9 of 
Chapter VIII with Y = T, having the given metric h = g(x), for which Y 
is certainly complete. Theorem 3.6 guarantees that the essential estimate 
hypothesis is satisfied, so the proof is complete. 


Corollary 3.9. Let (X,g) be a connected Riemannian manifold with 
seminegative curvature. Then (X,g) is complete if and only if the expo- 
nential map exp, is defined on all of T, for some x € X, and therefore 
for every xe X. 


Proof. That (X,g) complete implies exp, defined on all of T, was 
proved under all circumstances in Proposition 6.5 of Chapter VIII. The 
converse is now immediate from Theorem 2.10 and Theorem 6.9 of Chap- 
ter VIII. 


Corollary 3.10. Let (X,g) be a complete Riemannian manifold, simply 
connected, and with seminegative curvature. Let xe X. Then for all v, 
we T,X we have the inequality 


dist,(exp,(v), exp,(w)) 2 |lv — wll. 
Proof. By Theorem 3.7 the exponential map has an inverse 
go: X > T,X 
and by Theorem 3.6 this inverse satisfies 


lde(x)i, $1 


for all ze X, where the norm is that of a continuous linear map from 
T,X to Ty,)X, with their structures of Hilbert spaces due to g. The 
inequality of the corollary is then immediate from the definition of the 
length of curves. 


Corollary 3.11. Suppose that (X,g) is Riemannian, complete, with semi- 
negative curvature and simply connected. Then any two points can be 
joined by a unique geodesic whose length is the g-distance between the 
two points. 
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Proof. Immediate from Corollary 3.10, because if x, y are the two 
points, then y = exp,(v) for some ve T,X, and the geodesic a such that 
a(t) = exp,(tv) joins the two points, is unique by the Hadamard—Cartan 
theorem, and has length ||v||,. 


Remark 1. The above corollary is of course mostly subsumed in the 
finite dimensional case by the Hopf—Rinow theorem, but it was noticed 
in the Hilbert case in McAlpin’s thesis [McA 65]. Furthermore, McAlpin 
observed that one can define on the ball B(—2/c) with c <0 of a Hilbert 
space E a bounded seminegative metric, as in the finite dimensional case, 
namely for x € B(—2/c) and v, we E we let 


4<v, w> 
COW Fy Px? 
Then the ball has curvature —c?. Note that for constant curvature K 
one has 

R(v, w)z = K(<z, w>v — Cz, vw). 


Similarly one shows that the sphere has constant positive curvature. 
Standard proofs that the only simply connected manifolds with constant 
curvature are all of E, a sphere of finite radius for positive curvature, and 
the above example for negative curvature, work in the Hilbert case, and 
will be given below. 


Remark 2. Proposition 3.2 can be interpreted as implying that the 
geodesics which come from rays starting at the origin in the tangent 
space are orthogonal to the g-spheres in X. Of course it may happen 
that the exponential map is not an injective map of T, into X (as on the 
circle or 2-sphere), so the orthogonality interpretation holds only when it 
makes sense. In the particular case of seminegative curvature and com- 
pleteness of the Cartan—Hadamard theorem, the interpretation is valid 
everywhere. Note that Proposition 3.2 in the case of seminegative curva- 
ture is also a special case of the “local” result on orthogonality, Theorem 
5.6 of Chapter VIII, because we have a global chart coming from the 
Cartan-Hadamard theorem, and the previous arguments are valid for this 
chart. 


We now work out as examples the cases of constant curvature. 


Theorem 3.12. Let X be Riemannian complete, simply connected. Let 
Xo EX. 


(a) If R=0, i.e. if X has 0 curvature, then the exponential map 


exp,,: T,X > X 
is an isometry. 
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(b) Suppose the curvature is constant, equal to —1. Let Y also 
be Riemannian complete, simply connected, and let y o¢Y. Let 
L: T,,X — T,,¥ be a linear isometry, and let f: X + Y be defined 
by 
f =exp,,°0 Lo exp;} 


so f is a differential isomorphism according to Theorem 3.8. Then 
f is an isometry. In other words, up to an isometry, there is only 
one complete Riemannian manifold with given constant negative cur- 
vature modeled on a given Hilbert space (finite dimensional or not). 


Proof. For (a), we use Theorem 3.1 and Proposition 2.10 which 
shows that the exponential map amounts to parallel translation, so is an 
isometry. For (b), we argue in a similar way, but a bit more complicated. 
We have to show that for each x € T,,X the map 


df (x): T,X > Ty) Y 


is a linear isometry. Since dexp, (0) = id, it follows that df(xo) = L, so 
df(xo) is a linear isometry. Assume x # Xo. Let x = exp,,(rv) with some 
unit vector ve T,,X and r>0. Let y denote the map which to we T,X 
associates the Jacobi lift 4, of Theorem 3.1. Then 


df(x) = dexp,,(L(rv)) 0 L 0 dexp,, (rv) 
= dexp,,(rL(v)) 0 L 0 dexp,, (rv)! 


1 L(v) 1 (v) fa 
= =r) 0 Lo —%(r)) 
r r 


The map dexp,,(rv): T,X —> T,X is a linear isomorphism. To show that 
df(x) preserves norms is equivalent to showing that 


\|df(x) o dexp,,(rv)w|| = ||dexp,,(rv)w|| for all we TX. 
But we have 
\|df(x) o dexp,,(rv)w|| = |\dexp, (rL(v))L(w)||_—s for all, we T,X. 


We may now use Proposition 2.11 and Theorem 3.1 which describe dexp 
in terms of its components in the v-direction and a direction orthogonal 
to v, and parallel translation. Since in Proposition 2.11 the respective 
coefficients 1 and (sinhr)/r are the same whether we take dexp,,(rv) 
or dexp,,(rL(v)) because L is an isometry, preserving orthogonality and 
changing unit vectors to unit vectors, it follows that in the notation of 
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Proposition 2.11, 


sinh r\? 
\|dexp,, (rv)w||? = cS + ci ) = I|\dexp,,(rL(v))L(w)||, 


thus proving (b), and concluding the proof of the theorem. 
We also have the following variation in the case of positive curvature. 


Theorem 3.13. Let X be Riemannian, complete, simply connected, with 
sectional curvature +1. Then X is isometric to the ordinary sphere of 
the same dimension in Hilbert space. 


Proof. The proof is similar, except that one cannot deal with the 
exponential defined on the whole tangent space T,,X. For convenience, 
we let X be the unit sphere in Hilbert space of a given dimension, and 
we let Y be Riemannian, complete simply connected with sectional curva- 
ture +1. We can then define the map f on the open ball of radius z. 
The same argument as before, replacing sinh r by sin, shows that f is a 
local isometry. We then pick another point x, 4 +X 9. We let 


df (x) = L,: T,X > Thx) Y- 


Just as we defined f = f,, from x», we can define f; = f,, from x,. Then 
f and f, coincide on the intersection of their domain, and thus define 
a local isometry X > Y. By Theorem 6.9 of Chapter VIII, this local 
isometry is a covering map, and since Y is assumed simply connected it 
follows that f is a differential isomorphism, and hence a global isometry, 
thus proving the theorem. 


Remark. The above theorems may be viewed as fitting a special case 
of a theorem of Cartan, cf. [BGM 71], Proposition E.IIL.2. 


IX, §4. THE INDEX FORM, VARIATIONS, AND 
THE SECOND VARIATION FORMULA 


We let (X,g) be a pseudo Riemannian manifold, with the corresponding 
covariant derivative D. As a matter of notation, if w is a vector in a 
tangent space, then we write w? = <w, wy, If w? > 0, then we define 


I|w|] = <w, wg”. 


We begin by a general discussion concerning the Jacobi expression 
defining Jacobi lifts. Let «:[a,b]— X be a geodesic parametrized by arc 
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length. Let ny € Lift(«). We are interested in the expression 

Den — R(a, nyo! 
and its square. By definition, 7 is a Jacobi lift if and only if this expres- 
sion is equal to 0, and in the Riemannian case, it is equal to 0 if and 
only if its square is equal to 0. We shall also deal with a subspace of 


Lift(a), namely: 


Lift,(«) = vector space of lifts 7 of « such that 
n(a) = 0 and n(b) = 0. 


For n, y € Lift(«) we define the index 


b 
I(n, 7) ms | [<Dyn, Dy Yq + R(o’, n, a’, y)J (s) ds. 


Then J is a symmetric bilinear form on Lift(«), whose corresponding 
quadratic form is 


b 
I(n, ) = | [(Dyn)” + R2(a’, n)](s) ds. 


Similarly we define 


Jacy(«) = subspace of Jacobi lifts of « lying in Lifto(a), that is, vanishing 


at the end points. 


Proposition 4.1. The index form I on Lift(«) also has the expression 


I(n, ~) = -|’ [<Din, yg — Rio’, n, a, y)1(s) ds 
+ <DyN, Yg(b) — (Dyn, Y4(@).- 
In particular, if n is a Jacobi lift, then 
I(n,n) = Dyn, 1dg(b) — Dan, 1>g(Q); 
and if in addition n € Jaco(a), then I(n, n) = 0. 
Proof. From the defining property of the metric derivative, we know 


that 
d<Dyn, Yq = <Dzn, Ya + Dyn, Dy Yq: 
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Then the first formula is clear. If in addition 7 is a Jacobi lift, then the 
expression under the integral is 0 by definition, so the second formula 
follows; and if 4 Jac (a) then the expressions belonging to the end 
points are equal to 0, so the proposition is proved. 


Theorem 4.2. Let 7 € Lift(a). Then I(n, y) =0 for all y € Liftg(«) if and 


only if 
(D2n — R(e', na’)? = 0. 


In the Riemannian case, this happens if and only if n is a Jacobi lift. 
Proof. If y is a Jacobi lift, then by definition 
Den = R(a, na’, 


so I(y, y) = 0 for all y € Lifty(a). Conversely, assume this is the case. Let 
o be a C® function on [a, b] such that g(a) = o(b) = 0. Let 


y= Din—Ro,n and y= oy. 


Then y € Lifty(a) and by Proposition 4.1, 


b 


0 =I(n, y) = | (s)y1(s)? ds. 


a 


This being true for all g as above, it follows that y? =0, whence the 
theorem follows. 

The previous discussion belongs to the general theory of the Jacobi 
differential equation. Previously, we developed this theory to get infor- 
mation about the differential of the exponential map. The differential 
equation has another side to it, to which we now turn. We shall be 
interested in two functions of paths a: [a,b] > X: 


The length function 


b 
L*(a) = L(x) = | \|a’(s)l,ds whenever a'(s)? = 0. 


The energy function 
b 
E°(a) = E(a) = | a’(s)? ds. 


Note that the length does not depend on the parametrization, but the 
energy does. We are interested in minimizing those functions. 
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In calculus, one applies the second derivative test at a critical point of 
a function, that is a point where the first derivative is 0. The second 
derivative then has geometric meaning. One wants to do a similar thing 
on function spaces, or the space of paths. Ultimately, one can define a 
manifold structure on this space, but there is a simple device which at 
first avoids defining such a structure for some specific computations. We 
are specifically interested here in the example of the second derivatives 


d? d? 
qe Ee) and ape Ee) 


at t=0. To compute these derivatives, we don’t need to give a differ- 
ential structure to the path space, we need only be able to differentiate 
under the integral sign in the usual way. The computation of these 
derivatives is called the second variation formula, and the end result is as 
follows, for the variation of a geodesic. Observe how the index form 
enters into the result. 


Theorem 4.3. Let «:[a,b]— X be a geodesic parametrized by arc length, 
that is a'(s)* =1 for all s. Let o be a variation of «, so that “,(s) = 
a(s, t). Define 


n(s) = 0, a(s, 0) and v(s) = D,(s) a <D,n(s), at’(s) >ga’(s), 


so v(s) is the normal projection of D,-n(s) with respect to the unit vector 
a’(s). Also define a second component along «'(s), namely 


Y2(s) = (D020, 0, o),(s, 0) = «D2 0,4(s, 0), a’(s)>,. 
Let R,(v, w) = R(v, w, v, w) be the canonical 2-tensor. Then 


es E(a,) 


i = I(n, n) + y2(b) — y2(a) 


t=0 


b 
= | [(Dyn)? + Rp(a’, n)](s) ds + y2(b) — y2(@). 


As for the length, assuming the variation satisfies «/(s)? => 0 for all t, s: 


d? 
Freee) 


| [v? + Rao’, 1166) ds + y2(6) — naa), 


so this is the same expression as for the energy, except that D,-n(s) is 
replaced by the normal projection v(s). 
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If the curves t +> o(a, t) and t +> a(b, t) are geodesics, then 
¥2(b) = 72(a) = 9, 
so the terms involving the end points are equal to 0. 


Remark. The last assertion is immediate, since for any geodesic y, we 
have D,,y’ = 0. 


Proof of Theorem 4.3. Let 
e(s, t) = (0,0, 4,0), (s, t) = a4(s)?. 


Note that e(s,0) = 1 by hypothesis. We shall compute the second deriva- 
tive of the length, which if anything is harder than that of the energy 
because of the square root sign. The computation for the energy follows 
exactly the same pattern. We shall keep in mind that from the defini- 
tions, 


D, n(s) = D,0,0(s, 0). 


We begin with the first derivative, also called the first variation, 
d ad fe ; 
ae == [2 
qe it [ e(s, t)’* ds 
by 
(1) = | els, t)-"4, e(s, t) ds. 


This formula gives us the first derivative of length, about which we shall 
say more later. We now forge ahead to the second derivative, which is 


2 b b 
FL) = | 4e(s, t)*02e(s, t) ds — | Le(s, t) 9(d,e(s, t))? ds 


whence 


b b 
= | 303e(s, 0) ds — | 1(A,e(s, 0)? ds 
0 


a a 


d2 
0) 52 Hee) 
= I, => I,. 


We then compute separately the expressions under the integral signs. 
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For the second integral, we have: 


d,€ = 0,014, 010), 
= 2¢D,0,6, 0,0), because D is the metric derivative 


= 2¢D, 0,6, 0,0), by Lemma 5.3 of Chapter VIII. 


Evaluating at 0, squaring, and integrating yields the term 


b 
(3) I, = | <D,n, # >,(s) ds. 


Next we compute the first integrand. We have: 


dze = 0,<D, 4,0, 01,0), 
= 2<D,D, 6,6, 06,0), + 2<D, 0,6, Dz 6,0),. 


In the first term on the right, we use Lemma 2.7 to write 
D,D, = D,D, = R(0,6, 0,0). 


In the second term on the right, we use Lemma 5.3 of Chapter VIII to 
write D,0, = D,0,. Then we find 


je = 2¢D, Dao, 0,0), — 2{R(d,6, 0,0)020, 6,5), + 2(D, 0,0) 
(4) = 2D, D306, 8:0) + 2Rp(d,0, 826) + 2(Dz0,0). 


Finally, we use the metric derivative again to compute: 
(5) 0,<D2 626, 0,0), = (D, D2 0,6, 00>, + (D026, D,d,0),. 


However, D,0,0(s, 0) = D,6,«(s) = D,-«'(s) = 0, because « is assumed to 
be a geodesic. Hence from (3) and (4) we find 


(6) d3e(s, 0) = 2y5(s) + 2R2(0, 6, 0,0)(s, 0) + 2(D,0,0(s, 0))?. 


Now if w, u are vectors and u is a unit vector, and v = w — (w-u)u is the 
orthogonalization of w with respect to u, then we have trivially 


v? = w? — (w-u)?. 


Subtracting (3) from (6) and integrating yields the asserted answer, and 
proves the formula for the second derivative of L(a,) at t = 0. 
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Remark 1. For simplicity we limited ourselves to curves rather than 
piecewise C? maps. Milnor [Mi 63] gives a thorough discussion of paths 
where end-point terms will appear where the path is broken. See for 
instance his Theorem 12.2 and Theorem 13.1 of Chapter II. 


Remark 2. Observe how R, comes naturally in the formula. At a 
minimum one wants the second derivative to be semipositive, so having 
all plus signs in the variation formula is desirable. 


Corollary 4.4. Let 4 be a Jacobi lift of «, and o a variation of a such 
that y(s) = 0,0(s, 0). Assume that t > o(a, t) and t+ o(b, t) are geode- 
sics. Then 
d? 
qe El) = Par (b)s (6) 9g — <Darn(a), (a) 2, 
t=0 


In particular, if D,.y is perpendicular to «' then this equality also holds 
if E is replaced by the length L. 


Proof. Immediate from Theorem 4.3 and the alternative expressions of 
Proposition 4.1. 


Concerning the orthogonality assumption which will recur, we recall 
that if a Jacobi lift 1 of « is such that D,.7 is orthogonal to a at some 
point, then D,.7 is orthogonal to «' on the whole interval of definition. 
See Proposition 2.4. 


Proposition 4.5. Assumptions being as in Theorem 4.3, suppose that D,- 
is orthogonal to «’. Then 


d 
qe) ee = 0. 


Proof. We consider the expression for the first derivative given at the 
beginning of the proof in formula (1), we note that 


02e(s, t) 2¢0,a(s, t), 0, o(s, t)>> 
whence 


d,e(s, 0) = 2<0,a(s, 0), 6,0(s, 0), = <a'(s), n(s)>,- 


By Proposition 2.4, a Jacobi lift which is perpendicular to «’ somewhre 
remains perpendicular to a’ all along, so 0,e(s,0) = 0, which proves the 
proposition. 
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For some applications, one wants to compute the second derivative of 
a composite function f(L(q,)), where f is a function of a real variable, for 
instance when we determine the Laplacian in polar coordinates later. So 
we give here the relevant formula, since it is essentially a corollary of the 
above considerations. 


Proposition 4.6. Let f be a C? function of a real variable. As in 
Theorem 4.3, let o be a variation of a, and let n(s) = 0,0(s, 0). Assume 
D,- orthogonal to «'. Then 


=) (L(o,)) a f'(L(o)) [<Dy-n(b), 1(b)>, — <Dyn(a), n(a) 41. 


t= 


Proof. Let F(t) = f(L(@,)). Then 


F() =f (Lay) 4 Le) 


and 


F=f" (Loy) 4 La) + FLaa)( 2) La 


Then at t = 0 the first term on the right is 0 because of Proposition 4.5. 
The second term at t=0 is the asserted one by Corollary 4.4 and the 
orthogonality assumption. This concludes the proof. 


Example. Proposition 3.3 provides an example of the situation in 
Proposition 4.6. Both will be used in Chapter X, §2. 


Theorem 4.3, i.e. the second variation formula, also has some topologi- 
cal applications which we don’t prove in this book, but which we just 
mention. If R, is negative, so the curvature is positive, then one has a 
theorem of Synge [Sy 36]: 


Let X be a compact even dimensional orientable Riemannian manifold 
with strictly positive sectional curvature. Then X is simply connected. 


The idea is that in each homotopy class one can find a geodesic of 
minimal length. By the second derivative test, the expression for the 
second derivative of the length is 0 for such a geodesic. One has to 
prove that one can choose the variation such that the “orthogonal” term 
containing the integral of v(s)? is 0. The boundary term will vanish if 
one works with a variation to which we can apply Remark 1. Finally, 
having strictly positive curvature will yield a negative term, which gives a 
contradiction. Details can be found in texts on Riemannian geometry. 

The same ideas and the theorem of Synge lead to a theorem of 
Weinstein [We 67]: 
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Let X be a compact oriented Riemannian manifold of positive sectional 

curvature. Let f be an isometry of X preserving the orientation if 

dim X is even, and reversing orientation if dim X is odd. Then f has a 

fixed point, i.e. a point x such that f(x) = x. 

Proofs of both the Synge and the Weinstein theorems are given in 
[doC 92]. The Synge theorem is given in [BGM 71] and [GHL 87[93]. 

This as far as we go in the direction of the calculus of variations. 
These are treated more completely in Morse theory, for instance in 
[Mi 63], [Pa 63], [Sm 64], and in differential geometry texts such as 
[KoN 69], [BGM 71], [ChE 75], [doC 92], [GHL 87/93]. 


Incidentally, I hope to have convinced the reader further about the 
irrelevance whether the manifold is finite dimensional or not. 


IX, §5. TAYLOR EXPANSIONS 


We shall deal systematically with the Taylor expansion of various curves. 
We consider a curve in X, say of class C?, not necessarily a geodesic, 


a: JX, 


and we assume 0&€J, so 0 is an origin. We suppose given a spray on 
TX, giving rise to the covariant derivative. For w € T,o)X we let 


yith y(t, w) 


be the unique a-parallel curve with initial condition y(0, w) = w. Recall 
that «-parallel means D,,y = 0. We denote parallel translation by 


PL = Pa = Boe Teo X 7 Ty X 


Then P, is a topological linear isomorphism, as we saw in Chapter VIII, 


§3. 
Proposition 5.1. Let 4: J+ TX be a lift of « in TX. Then 
m tk 
nt) =P, > Den +0(t™") — for t+0; 
k=0 : 


or alternatively, 


m k 
n(t)= ¥. y(t, Dén©))— + Of") for t 30. 
k=0 k! 
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Proof. The second expression is merely a reformulation of the first, 
taking into account the definition of parallel translation. Since t— 0, 
the formula is local, and we may prove it in a chart, so we use y, y to 
denote the vector components ny, yy in a chart U, suppressing the index 
U. Let 


m k 
BU) = nf) — ¥ oft, Din) jg. 
k=0 : 


From the existence and uniqueness of the ordinary Taylor formula, it will 
suffice to prove that for the ordinary derivatives of B, we have 


8*B(0) = B™(0)=0 = for -k=0,...,m. 


By definition, note that B(0)=0. Let w, = DEB(0). Since D,y =0, we 
have 


; : ted 
Di. B(t) = Din(t) — t, w,)———. 
Therefore 
Dz B(0) = Di-n(0) — y@, w,) 
SW Wy 
= 0. 


We now need a lemma. We let E be the Banach space on which X is 
modeled. 


Lemma 5.2. Let B: J —E be the vector component of a lift of a. If 
Di,B(0) = 0 for 0 <j <m then 0/B(0) = 0 for 0 <j <m. 


Proof. By definition, 
DB = B Bae Bia; a’, p). 
Hence D,,B(0) = f’(0). We can proceed by induction. Let us carry out 


the case of the second derivative so the reader sees what’s going on. 
Hence suppose in addition that D2B(0) = 0. From the definitions, we get 


Dz B = B" — (6, Bia; a, B)a’ + B(a; a”, B) + Bw; a, B’)) 
— B(a; a, B’ — Ba; a’, B)). 


Since (0) = f’(0) = D,-B(0) = 0 we find that 


0 = Dy BO) = B’O), 
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thus proving the assertion for m = 2. The inductive proof is the same in 
general. 


We apply the above considerations to Jacobi lifts. 


Proposition 5.3. Suppose that a is a geodesic. Let w € Tyo)X and let n, 
be the Jacobi lift of « such that n,(0) = 0 and D,-n,,(0) = w. Then 


3 
Nw(t) = pw + R(a'(0), w, «(0)) a + O(t*). 


Proof. We plug in Proposition 5.1. Since D2n,, = R(a’, n,,, «’) contains 
ny linearly, the evaluation of the second term of the Taylor expansion at 
0 is 0. As for the third term, we have to use the chain rule. To be sure 
we don’t forget anything, we should write more precisely 


R(a’, Ny, &) = RW; &', Ny, @’) 
to make explicit the dependence on the extra position variable. But it 
turns out that it does not matter in the end, because no matter what, the 
chain rule gives 
D3n, = R(a’, D,n,,, «) + terms containing y,, linearly, 

so D3n,,(0) = R(«’(0), w, «’(0)), which proves the proposition. 

From Proposition 5.3, we get information on the pull back of the 
metric g of a pseudo Riemannian manifold, to the tangent space at a 


given point. 


Proposition 5.4. Let (X,g) be a pseudo Riemannian manifold, and let 
x eX, Fix v, we T,X. Then 


exp*(tv)(g)(w, w) = w? + 4R,(v, wt? + O(t3) = fort 0. 
where we recall that R,(v, w) = R(v, w, v, w). 


Proof. From the theory of Jacobi lifts, applied to «(t) = exp,(tv), we 
have the formula 


1 
; n,(t) = dexp,(tv)w. 


Therefore modulo functions which are O(t*) for t>0, we get from 


260 CURVATURE [IX, §5] 
Proposition 5.3 


exp? (g)(tv)(w, w) = (—n,(0), inal) (a(t) 
g(a(t)) 


(P | w+ Re w, v) ab? | w+ Re w, A) 
g(a(t)) 


w+ R(v, w, of w+ R(v, w, 05) 
g(x) 


w? + 2R,(v, We 


which proves the proposition. 


The preceding proposition gives us the Taylor expansion of the map- 
ping 
S(v) = expF(g)(v) = for ve TX 


along rays through the origin. Observe that 
Sf: TX — L2.,,( FX) 


is a map of the self-dual Banach space T,X to the space of symmetric 
bilinear forms on T,X (actually the open subset of non-singular forms). 
The map f has a Taylor expansion 


(*) Fe) = £0) + Ai) + fav) + O(lv|?) for |v] +0, 


where v +> |v| is a Banach norm on T,, and where f, and f, are homoge- 
neous of degree 1 and 2 respectively. Since the homogeneous terms in 
the Taylor expansion are uniquely determined by f, and since we com- 
puted their restrictions to rays through the origin in Proposition 2.4, we 
now obtain: 


Theorem 5.5. Let (X,g) be a pseudo Riemannian manifold. Let x € X. 
For ve T,X let q(v) € L2,,(T,X) be the symmetric bilinear function such 
that 
1 
q(v)(w1, W2) = Rv, Wy, v, W2). 


3 
Let the metric g be viewed as a tensor in L2,m(TX), and let f be the 


pull back of the metric g in a star shaped neighborhood of 0, in T.X 
where the exponential map is defined. Then 


F(ve) = g(x) + qv) + O(\v|*) for |v] +0. 


CHAPTER X 


Volume Forms 


For the first time we meet a strictly finite dimensional phenomenon: If X 
is of finite dimension n, then the n-forms ./"(X) play a distinguished role 
whose extension to the infinite dimensional case is not evident. So this 
chapter is devoted to these forms of maximal degree. In the next chap- 
ter, we shall study how to integrate them, so the present chapter also 
provides a transition from the differential theory to the integration the- 
ory. 

Although for organization and reference purposes it is convenient to 
place together here a number of results on volume forms, only the first 
section giving a basic definition will be used in the next three chapters, 
so the other sections may be skipped by a reader wanting to get immedi- 
ately into integration. 


X, §1. THE RIEMANNIAN VOLUME FORM 


Let V be a finite dimensional vector space over R, of dimension n. We 
assume given a positive definite symmetric scalar product g, denoted by 


(v, w) <0, w>, = g(v, w) for v, we V. 


The space /\"V has dimension 1. If {e,,...,e,} and {u,,...,u,} are 
orthonormal bases of V, then 


Cy AT ANG = FU ATA Uy. 


Two such orthonormal bases are said to have the same orientation, or to 
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be orientation equivalent, if the plus sign occurs in the above relation. A 
choice of an equivalence class of orthonormal bases having the same 
orientation is defined to be an orientation of V. Thus an orientation 
determines a basis for the one-dimensional space /\"V over R. Such a 
basis will be called a volume. There exists a unique n-form Q on V 
(alternating), also denoted by vol,, such that for every oriented ortho- 
normal basis {e,,...,e,} we have 


Q(e,,...,e,) = 1. 


Conversely, given a non-zero n-form Q on V, all orthonormal bases 
{e,,...,€,} such that Q(e,,...,e,) > 0 are orientation equivalent, and on 
such bases Q has a constant value. 

Let (X,g) be a Riemannian manifold. By an orientation of (X,g) we 
mean a choice of a volume form Q, and an orientation of each tangent 
space T.X (x € X) such that for any oriented orthonormal basis {e,,...,¢,} 
of T,X we have 

(1, ---5€,) = 1. 


The form gives a coherent way of making the orientations at different 
points compatible. It is an exercise to show that if (X,g) has such an 
orientation, and X is connected, then (X, g) has exactly two orientations. 
In Chapter XI, we shall give a variation of this definition. By an oriented 
chart, with coordinates x,, ...,x, in R", we mean a chart such that with 
respect to these coordinates, the form has the representation 


Q(x) = p(x) dx, A+++ A dx, 


with a function @ which is positive at every point of the chart. We call 
Q the Riemannian volume form, and also denote it by vol,, so 


vol, (x) = Q(x) = Q,. 


We return to our vector space V, with positive definite metric g, and 
oriented. 


Proposition 1.1. Let Q = vol,. Then for all n-tuples of vectors {v,,...,v,} 
and {w,,...,w,} in V, we have 


Q(0,, .--,0,)Q(W,, ...,Wa) = det Cv;, w,>,- 


In particular, 
QW, ...,0,)? = detdv,, ,),. 


Proof. The determinant on the right side of the first formula is multi- 
linear and alternating in each n-tuple {v,,...,v,} and {w,,...,w,}. Hence 
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there exists a number c € R such that 
det ¢v;, wi>g = CQ(Y,, ..-,0,)Q(W1, --- Wa) 


for all such n-tuples. Evaluating on an oriented orthonormal basis shows 
that c = 1, thus proving the proposition. 


Applying Proposition 1.1 to an oriented Riemannian manifold yields: 


Proposition 1.2. Let (X,g) be an oriented Riemannian manifold. Let 
Q=vol,. For all vector fields {&1,...,E,} and {n,,....n,} on X, we 
have 

(S15. Sn)Q(N1, --- Mn) = det (di, mj )q- 
In particular, 


QE, eee = det <¢;, oe 


Furthermore, if €Y denotes the one-form dual to & (characterized by 
€’(n) = <é, n>, for all vector fields y), then 


QS 1, «EQ = CY Av A Gy. 


This last formula is merely an application of the definition of the wedge 
product of forms, taking into account the preceding formulas concerning 
the determinant. 

At a point, the space of n-forms is 1-dimensional. Hence any n-form 
on a Riemannian manifold can be written as a product gQ where @g is a 
function and Q is the Riemannian volume form. 

If € is a vector field, then Qo € is an (n — 1)-form, and so there exists 
a function @ such that 


d(Q 0 €) = @Q. 


We call g the divergence of € with respect to Q, or with respect to the 
Riemannian metric. We denote it by divg € or simply div é. Thus by 
definition, 

dQ o €) = (div €)Q. 


Example. Looking back at the example of Chapter V, §8 we see that 
if 
Q(x) = dx, A+++ A dx, 


is the canonical form on R" and € is a vector field, then its divergence is 
given by 
: n 06; 
dive € = Or 
i=1 Ox; 
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We shall study the divergence from a differential point of view in the 
next section, and from the point of view of Stokes’ theorem in Chapter 
XIII. 

On 1-forms, we define the operator 


d*: J'(X) > A(X) 


by duality, that is if AY denotes the vector field corresponding to A under 
the Riemannian metric, then we define 


d*j = —div i’. 
One defines the Laplacian or Laplace operator on functions by the formula 
A = d*d = —divo grad. 
Proposition 1.3. For functions 0, w we have 
A(py) = pA + pAg — 2<d@, dp),. 
Proof. The routine gives: 
A(py) = d*d(py) = d*(p dp + 9 dp) 
= —div(Wliy) — div(peay) 


—w div Caw _ (dW) Cap — @ div Cay = (do) Cay 
wAg + pA — 2<dg, dp, 


as was to be shown. 


Recall that 
(do, dip>, = grad ¢, grad Wa> 


so there is an alternative expression for the last term in the formula. 

More formulas concerning the Laplacian will be given in the next 
section, using the covariant derivative and the variation formula. For 
applications of such formulas and theory to the heat kernel, cf. [Cha 84], 
especially Chapters II and III, in addition to [BGM 71]. 


X, §2. COVARIANT DERIVATIVES 


In this section, we gather together a number of results on the covariant 
derivative in connection with volume forms on the oriented Riemannian 
manifold (X, g). 
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Proposition 2.1. Let D be the metric covariant derivative. Then 


for all vector fields €. 


Proof. We still write Q = vol,. Let €,,...,¢, be any vector fields. We 
take the covariant derivative of the relation 


(1) QE 1, SwQ= CL AA GY 


from Proposition 1.2. Let @ be the function g = Q(é,,...,é,). Then by 
relation (1), 


(2) D(9Q) = VEY Av A (DeE)Y Av A Gy 
= P(E, ..., Dee, ..5€,)Q 
after using Proposition 1.2 and COVD 3 of Chapter VIII, §1. But also 
(3) De = (DQ)(E 1, -+658n) + Y Er, sDeSis «En. 
Since D,(gQ) = (Dzg)Q + @D,Q, from (2) and (3) we obtain 
(4) (DeQ)(E,, ---56,)2 + PDQ = 0. 
We evaluate the expression on the left on (€,,...,¢,), and find 


QE 1 5 + 5Sn(DeQ)(E1, ---.8n) = 0. 


This being true for all choices of vector fields €,, ...,€,, the proposition 
is proved. 


Remark. The above theorem remains true suitably formulated in the 
non-oriented case, because the theorem is local, and locally, the absolute 
value of the form differs by +1 from the form itself. 


The next theorem will give an application of Theorem 2.1. 

The metric derivative D operates on vector fields and also on r-forms 
for all r, especially r= 1 and r=n. For any vector field € we let Dé be 
the endomorphism of [TX such that 


(DE)n = D,¢. 
At each point x e X we have the operator 


(D¢),: T.X > T,X such that (D¢),(v) = (D,¢)(), 
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on the finite dimensional vector space T,X. This allows us to take the 
trace tr(D¢) of this operator at each point, so to take tr(Dé),. The trace 
can be computed as usual by using an orthonormal basis. We recall that 
a finite sequence of vector fields is called an orthonormal frame (on some 
open subset of X) if they are orthonormal for the metric g, that is 


if i=j, 


1 
Reis Cp a 10 if iFxj. 


Given a point xe¢X, such an orthonormal frame always exists in a 
neighborhood of x. Similarly, we can define DA for a 1-form 2 € .1(X), 
whereby 
DA: TTX TTX is such that (DA)(¢) = D;A. 
Thus for each x € X, (DA), may be viewed as a linear map 
(Dd),: T.X > T,X, 
whose trace can be computed by using duality, namely 


tr(DA) = 9" <Dz,4, 6). 


On the right side, we use the convenient notation <4, €) = A(é) for a 
1-form 4 and a vector field €. In such a case, there is no subscript g on 
the scalar bilinear pairing between functionals and vectors. 


Theorem 2.2. Let €,, ...,6, be an orthonormal frame of vector fields, 
and let € be a vector field. Then 


div ¢ = 2 {Di Ss Gigs 
In particular, for 4 € of'(X) we have 
div 2¥ = tr(DA). 


Proof. Let Q= vol, be the volume form. By COVD 6 of Chapter 
VIII, §1, and Proposition 2.1, we get 


dQ. 0 C)(15 ++ 05Sn) = by (=I *De (Qo E) (Eq, 0.5655 «End 


= Vi (-1' 10 Dy (Eq, bjs ve eske) 
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and since D,.¢ has the Fourier expression D;,¢ = ¥ CDz,6, Fj >gSj5 
7 


= by (De, Eg Eas +s) 


But also d(Q o E)(E,,...,,) = (div €) vol,(¢,,...,¢,,). Hence 


div é =. = <Dz.6, Sys 


which proves the first formula. The second is a mere rephrasing, applied 
to the vector field 4”. 


Directly from the definition of the operator d* in the preceding sec- 
tion, we now obtain: 


Corollary 2.3. On a 1-form 4, we have d*A = —tr(DA). 
We can then apply this to the Laplacian, to get: 


Corollary 2.4. Let &,, ...,€, be an orthonormal frame as in Theorem 
2.2. Let o be a funtion. Then 


Ag = —tr(D do) = — ¥) (Dg, do, &:) = — Y <Dglerad 9), 2, 


If {u,,...,u,} is an orthonormal basis of the tangent space T,X at some 
point x € X, and a; is the geodesic with «;(0) = x and a(0) = u;, then 


do(e) = — 5. (9° 4)'0). 


Proof. The first assertion comes from applying Theorem 2.2 to A = dg. 
The second assertion then follows by using Corollary 4.4 of Chapter 
Vil. 


From the preceding corollary, we can obtain an expression for the 
Laplacian in polar coordinates. I follow [BGM 71]. We pick a point 
x é€X as an origin, with its tangent space T,X. We let U,, be an open ball 
centered at 0, on which exp, induces an isomorphism to its image, and we 
let ye U,. We want to determine Ag(y) for a function @ which depends 
only on the Riemannian distance from x, say 


o(y)=f(r(y)) where r(y) = dist,(x, y), 


and f is a C? function of a real variable. 
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Proposition 2.5. Let «=a, be the unique geodesic from x to y#x, 
parametrized by arc length, and let e, =a'(r)€ T,X. Let e2, ...,€, be 
unit vectors in T,X such that {e,,...,e,} is an orthonormal basis of 
T,X. Let n; (i = 2,...,n) be the Jacobi lift of « such that 


n{0) = 0 and nr) = e;. 
Then 


Ag(y) = -f"™) -f') xX <Dynilr), nilr)>g- 
Proof. Let B; (i = 1,...,n) be the geodesic from y such that 


B(O)=y and = B,(0) = ¢;. 


Observe that f,(t) = a,(r + t) for small t, by the uniqueness of the inte- 
gral curve of the corresponding differential equation. We apply Corollary 
2.4 to the Laplacian at y, and the geodesics f; (i= 1,...,n) to get 


Ag(y) = -¥ (9 B)"O. 


Since 8, (t) = a,(r + t), we can split off the first term, to obtain 
Ag(y) = —f"(") — x (po B;,)"(0). 


Let a; be the unique geodesic from x to B,(t) (for small t), parametrized 
by arc length. Thus «;,, is what we called the variation of a at its end 
point, in the direction of e;, for i= 2, ...,n. Then by Propositions 3.3 
and 4.6 of Chapter IX, and the fact that 


(po BO) = f(L@,.), 


we obtain 


(9 © Bi)"O) = f') <Denilr), nil) dg, 


which proves our proposition. 


X, §3. THE JACOBIAN DETERMINANT OF 
THE EXPONENTIAL MAP 


We continue to consider a Riemannian manifold (X,g). We let x eX, 
and we let B, be an open ball in T,X centered at 0,, such that exp, 
gives an isomorphism of B,, with its image in X. Thus without loss of 
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generality, we may assume X oriented, and we let vol, be the volume 
form on X. We call B, a normal chart at x. For ye€exp,(B,). We 
write y = exp,(v,), so v, = log,(y), as it were. 


We note that the differential 
dexp,(v,): T, > T, 


is a linear isomorphism, and both T, and T, have the positive definite 
scalar products of the Riemannian metric, so we may define the absolute 
value of the determinant of (dexp,)(v,). One simply picks orthonormal 
bases in each one of these vector spaces, an the determinant of the 
matrix representing (dexp,)(v,) with respect to these bases. Picking 
oriented bases actually makes the determinant positive, so we don’t need 
to take an absolute value. We let J denote the Jacobian determinant, so 


expt vol, = J vol.u. or also exp* vol,(v) = J(v) vol.,,(v), 


where vol,,, is the euclidean volume on T,X, determined by the positive 
definite metric g(x), and v is the vector variable in T,X. We shall express 
J in polar coordinates. 

Let S(1) be the unit sphere in T.X. Any vector ve T,X, v £0, can be 
written uniquely in the form 


v=ru, 


where u is a unit vector in the direction of v, and r>0. We call (r, u) 
the polar coordinates of v. Then the euclidean volume has the usual 
decomposition 


Vol. ye(r, u) = r) dr dp(u), 
where dy(u) is the usual spherical measure (d@ in dimension 2). Then 
(exp? vol,)(ru) = J(r, u)r"* dr du(u). 


Proposition 3.1. Let u be a unit vector in T,X and let a be the geodesic 
parametrized by arc length such that «(0) = x and «'(0)=u. Put u=w, 
and let {u,w2,...,w,} be a basis of T,X such that w,;lu for i=2, 
...n. Let n; (i= 2,...,n) be the Jacobi lift of « such that 


n (0) = 0 and D,n{0) = w;. 
Then 


a _ det(n,(r), ...,n(r)) _ det’? <n), n,(r)>, 
i ‘I(r, u) = det(w2, 2p Wy) 3 det(w,, 5Wn) 


The determinant on the right is taken for i, j = 2,...,n. 
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Proof. Observe that we may also use 4,, which is such that 7,(t) = 
ta’(t). The equality between the two expressions on the right of the 
equality sign follows from Proposition 1.1. Let f =exp,. Then for any 
vectors W,,...,W, € 1,X we have 


(exp* vol,)(v)(w1,....W,) = vol,(df(v)w,, ...,df(v)w,) 
= det(df(v)w,,...,df(v)w,) 
= J(v) det(w,,...,W,). 


We put v =rw, =ru. By Theorem 3.1 of Chapter IX we know that 


1 
dexp,(ru)w; = 7h). 


Then for i = 1, 4,(r)/r = ’(r), which is a unit vector perpendicular to the 
others. Thus to compute the volume of the parallelotope in euclidean 
n-space, we may disregard this vector, and simply compute the volume of 
the projection on (n— 1)-space, and thus we may compute only the 
(n — 1) x (n — 1) determinant of the vectors 


1 
det(2(r)/r, ...sta(r)/r) = 7a=i det(n2(r), ..-,malr)), 


from which the proposition falls out. 
Proposition 3.1 is applied in several cases. 


Corollary 3.2. If in Proposition 3.1 all the vectors w, are unit vectors 
u; such that {u,,...,u,} is an orthonormal basis of T,X, and u=u,, 
then we have simply 


r 1S (r, u) = det’?<n,(r), (7) 4. 


From this case and the asymptotic expansion for the Jacobi lifts, we 
obtain: 


Corollary 3.3. Again with an orthonormal basis {u,,...,u,} of T,X, let 
u =u, and 
Ric(u, u) = ¥) Rp(u, u;). 
i=2 
Then 


2 
exp* vol, (ru) = E + Ric(u, W) es 00°) VOleye(ru) for r+. 
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Proof. By Corollary 3.2, J(r, u) is det’?<n,(r)/r, n(r)/r>, with the deter- 
minant taken for i, j= 1, ...,n or i, j=2, ...,n. Using the asymptotic 
expansion of Chapter IX, Proposition 5.4 and the orthonormality, one 
gets that 


n 2\ 1/2 
Jr, u) = T] (1 + 2R,(u, “5 +0() for r0, 


i=2 
which is immediately expanded to yield the corollary. 


Example. Suppose dim X = 2. Then Ric(u, u) = R,(u, u,) = R2(u;, uy). 
Putting u, = u’, we get 


2 
I(r, u) = 1+ Ryu, uw) +0(?)  forr>0. 


If we keep u fixed, and use A in polar coordinates, A= —6? —r‘6,, 
then we see that 
R,(u, u’) = —3AJ(0). 


Compare with [He 78], Chapter I, Lemma 12.1 and Theorem 12.2. 


For the further asymptotic expansion of the volume, see [Gray 73], as 
well as applications referred to in the bibliography of this paper. 

On the other hand, we shall also meet a situation where {w,,...,w,} 
is not an orthonormal basis as in the next corollary. Cf. Chapter IX, 
Proposition 3.3. 


Corollary 3.4. Let exp,:B,— X be the normal chart in X as at the 
beginning of the section, and y = exp,(ru) with rue B,, and some unit 
vector u. Let a(s) = exp,(su) and let e, =«'(r). Complete e, to an 
orthonormal basis {e,,...,@,} of T,X, and let n; be the Jacobi lift of « 
(depending on y, or r if u, is viewed as fixed), such that 


n (0) = 0 and n(r) = e; for i= 2, ...,n. 


Let J’(s, u) = 0,J(s, u). Then 


= | n 
J [Jr w) + = Yi Dem) MO, 


Proof. In the present case, D,-y,(0) = w; is whatever it is, but we 
observe that the determinant det(w,,...,w,) is constant, so disappears in 
taking the logarithmic derivative of the expression in Proposition 3.1. 
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We also observe that in the present case, 


<nilr), nT), ot 5; 


so the matrix formed with these scalar products is the unit matrix. Tak- 
ing the logarithmic derivative of one side, we obtain 


J'/J(r, u) + (n — L/r. 


Let hj; = <nj,nj>,, and let H =(h;;). On the other side, we obtain the 
logarithmic derivative 

1 (det HY 

2 det H - 


Let H,, ...,H, be the columns of H. By Leibniz’s rule, we know that 
(det H) = > det(H,,...,Hj,...,H,)- 
i=2 


Observe that 
Mis 159% = Dar tis Nyy + Nis Daj 
and in particular, 
Nis Ny = 2¢Dy Nis Nirg- 


What we want follows from a purely algebraic property of determinants, 
namely: 


Lemma 3.5. Let A =(A',...,A™) be a non-singular m x m matrix over 
a field, where A‘, ...,A™ are the columns of A. Let B =(B',...,B™) be 
any m x m matrix over the field. Then 


¥, det(A’, ...,BY,...,A”) = (det A) tr(A*B). 


Proof. Let X = (x;;) be the matrix such that 
XA + ott + Xi A" = Bi for i=1,...,m. 
By Cramer’s rule, 
x,; det(A) = det(A!,...,B',...,A™). 


But AX = B so X = A™'B, and the lemma follows. 
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We apply the lemma to the case when A = H(r) is the unit matrix and 
B/ = H;(r) to conclude the proof. 


Corollary 3.6. Let @ be a C? function on a normal ball centered at the 
point x ¢X. Suppose that depends only on the g-distance r from x, 
say o(y) = f(r(y)). Let y = exp(ru), with a unit vector u. Then 


—1 
Ae) = Ff") — f'0 (su6. u) + 1), 


Proof. Combine Corollary 3.4 with Proposition 2.6. 


For further applications of Jacobi lifts to volumes, cf. for instance 
[GHL 87/93], Chapter 3H. 


X, §4. THE HODGE STAR ON FORMS 


We already touched on the star operation on functions, and we defined 
d* on 1-forms. We now deal systematically with the star operation on 
alternating forms. I shall follow Koszul’s formalism in formulas S 1 
through S 8 [Ko 57], which is quite elegant. A direct very brief treat- 
ment of just what is needed to get the global duality and adjointness of 
d, d* using Stokes’ theorem, will be done in a self-contained way ad hoc 
in Chapter XIII, so that the reader need not go through the systematic 
formalism just to understand that particular application of Stokes’ theorem. 

Until further notice, we don’t differentiate, and the theory is punctual, 
sO: 


We let T be a finite dimensional vector space over R, of dimension n, 
with r-forms , W in L1(T), and with vectors ve T. We suppose that T 
has a positive definite scalar product g, and is oriented so we have a 
volume form Q, =Q. We let v be the 1-form dual to v under g. 


S 1. There exists a unique isomorphism +*: Li(T)—> L"~"(T) such that 
for all v,, ...,0,-,€ T an 9 € L(T) we have 


(#Q) (01, ---50, JQ =O AVL A AD,. 


Proof. Given g, the right side of the above equation is a multilinear 
alternating function of v,, ...,v,-, into the 1-dimensional space of n- 
forms, so having chosen © as a basis for this space, we get a real-valued 
form, which constitutes the coefficient of Q on the left side. The associa- 
tion 

Pp t> * 
is obviously linear. 
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S 2. We have *Q =1 and *1 =Q, and for a function f, *(fQ) = f. 
Proof. Immediate from the definition S 1 and Proposition 1.1. 
S 3. For gp € L{(T) and v,, ...,v,-,€ T we have 
(#9) (015 --+sUn—r) = *(M A VY A 7 A Up-y)- 
Proof. Using S 2 and S 1, we find: 
(DP A DY Aw A Upp) = #L(#Q)(Oy Ao A U,-y)Q] 
= (#0) (V1, ..-,0,—-)(*Q) 
= (*@)(U1,..-,U_—,)- 


S 4. For 9 €L(T) and ve T we have 
*(9 Av’) =(*@) ov. 
Proof. Indeed, 


(«(p A OY Wbiss.a,tepey) = *(@ Avy A vy ACN Un—r-1) 
= (+@)(v, U1, sasleee ee) 
= ((*) ° v)(v, gree gVy—p-1 ‘ 


For the next property, we need a lemma independently of the star opera- 
tion. 


Lemma 4.1. For g € Li(T), and v, 01, ..-,0,—p41 € T, we have 


(pov) A VY AA Upp sd 
n-rt+1 a “N 
= > (—1TK0Y, 0, (@ A OY Av A OY Att A Og 141). 


i=1 


Proof. The basic formalism of forms tells us that the contraction with 
a vector is an anti-derivation on the algebra of forms (Chapter V, §5, 
CON 3). Since @ A vf A+: A vf_,4, has degree n + 1 and so is equal to 
0, we find 
O= (MA LY Av A OY p41) 00 


= (POV) ADL Av A Of rat 


n—r+1 
+ pS (-1 ti 0 A oY Av A (UY Ov) Av A OY ay 
é 
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We observe that 
bY ov = 0), 0, = <0, 0: = <v¥, >, 
to conclude the proof of the lemma. 
S 5. For any form o € L{(T) and ve T we have 
#(9 00) = (—1)"(#@) Av”. 
Proof. First, for all v,, ...,v,_,€ T we have 
(1) (#(@ © v))(O1, ---Pn-r41) = #(( 00) A DY Av A Oy p41): 
On the other hand, 


(—1)""1(«@) A vY =(—1)J*10Y A «@. 
Hence 


((- 1)"~1(*@) A vY )(v,, 1 Up—rti) 


= (—1)*(" A *0)(v,, siagaarti) 
n-r+1 


oa (—1)'** <0", v,>((#9) (01, oe bj, we yDn—p 41) 


i=1 


n—-r+1 ms 
DY HL KoY, (MA OF Av A OY Av A One): 
i=1 


Using Lemma 4.1 and (1) concludes the proof. 
We can do an induction on S 5, and also get a corollary: 
S 6. For 9 ELY(T), «* =(—1)~» and 
(eQ) A vY Av AY =(—1)" YQ 0 0, 0 +++ 0 8,). 
Proof. We have 


(#49) (01, -..50,) = #((#) A vY Av AY"), 


[applying S 5 repeatedly] = (—1)""~Y#(*(@ 0 v, o-*+ 0 »,)). 


Since for any function f we have «f = f*l and *«f = *fQ = f, property 
S 6 follows. 


S 7. Let S denote the * operation. Then S: Li(T)— L2~"(T) is an iso- 
morphism. 


This is immediate, but is stated for the record. 
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S 8. Let o, pe Li(T). Then 


Qaxep=Wn xq. 


Proof. The pairings of ~, y given by the expressions on the left and 
right are bilinear, so it suffices to verify the equality when 


Q=vL AAD, and w=wi{ ac aw,. 


i 


In this case, we obtain 


gp raxy =(-1" (ap) AG 


=(—1FMa(wy Av Aw) A OY Ao A OY 


[by S 6] = (1 P(- 1 Yal(wy A+ A w,’) 0 01 077+ 0 0, ] 
[by S 2] = det (w;, »;),2. 


io “j 


But det<w,, v,>, = det<v;, w;>,, from which S 8 follows. 


The next formula proves that the star operation is given in a simple- 
minded way on natural basis elements for the wedge products. We shall 
use this property in Chapter XIII, §4, in a self-contained way to make 
the results on integration independent of the general star formalism, but 
the next formula won’t be used in the rest of this section or the next. 


Proposition 4.2. Let {v,,...,v,} be an orthonormal basis of T. Let @,, 
...;0, be the dual basis of 1-forms. Let I = (i,,...,i,) with iy <-+** <i, 
and let J =(jy,..-.jn-p) with j, <-+*'<j,-, be the complementary set 
such that {1,...,n} is a permutation of (I, J). Let e(I, J) be the sign of 
the permutation. Assume v,,..., v, oriented. Let 

Oy; =O), N***A O;, and Oy =O; A AO; 


Jn-r* 


Then 
*@, = (I, J)wy. 


Proof. Directly from the definition of Q = Q, we have that 


At first, let J be an arbitrary sequence of n —r indices among (1,...,n). 
Then by S 3, 


(+@,)(v;, gee 50; ,) = *(@, A @,), 
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which is # 0 if and only if J is the complementary set, ie. (I,J) is a 
permutation of (1,...,n). In this case, the right side of the above expres- 
sion is simply e(I, J)#Q = e(I, J). Alternatively, one may write 

*o, = é(I, J)o,, 


if (I, J) is a permutaiton of (1, ...,n), from which Proposition 4.2 follows. 


We are now through with the punctual theory, and we pass to a Rieman- 
nian manifold (X,g), where the vector space T is replaced by the tangent 
bundle TX, and vectors are replaced by vector fields. We let D be the 
metric covariant derivative as usual. Also 


b"(X) =TL(TX). 


Proposition 4.3. The star operation commutes with every D,;, i.e. for any 
vector field € and yp € &"(X), we have 


+*D,9 = D.*@. 
Proof. For 0-forms (functions) and n-forms (functions times the vol- 


ume form) the assertion is immediate by using Proposition 2.1, to the 
effect that D. vol, = 0. Now let pe TL{(TX). Then: 


(Dz*@) (1, 8 S56 je) + > (*@)(C1; ria Dei; gacgene) 


= DA(*@)(E1,---5En-r)) [because D, is a derivation] 
=DxXep ancy asad) — [by S 3] 
= (*D.)(9 A Ef A+++ A Gy_,) [by the proposition for n-forms] 


= DO NEY NA Eva) + MON EL ADELA Ei) 
= (De OM Es -s-sbner) + Y, OME rs Debis bee) 


which proves the proposition. 
We now define d* in general to be 
d* =(—1)"*"*lade on of"(X). 
In Chapter XIII, §3 we shall define a scalar product on forms with 


compact support for which d* will be seen to be the adjoint of d. For 
the moment, we continue with an essentially differential algebraic theory. 
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Proposition 4.4. For o, yw € W(X) we have 


dp n*W=@ a (*d*p) + d(p a *). 


Proof. Immediate from the definition of d*, S 6, and the basic formula 
for d of a wedge product (a graded derivation). 


Proposition 4.5. Let €,, ...,¢, be a frame of vector fields, and let €, 


...56, be the dual frame, that is <&,¢)>,=6;;. Then for any form 
op € S"(X) we have 


d*p = > (Dz,9) © &. 


Proof. Proposition 1.1 of Chapter VIII gives us an expression for 
d(*@) in terms of the frame. The dual frame is such that 4, = €;. Then 
the formula of Proposition 4.4 is an immediate consequence of S 5. 


Remark. If the frame €,,...,6, is orthonormal, then of course €; = €;. 


We define the Laplacian associated with the Riemannian manifold 
(X, g) to be 


A = dd* + d*d, operating on each /"(X). 


On Euclidean space R" with its standard positive definite scalar product, 
the Laplacian on functions is the usual operator (with the minus sign) 


aeeE 


As a more general example illustrating the role of Ricci curvature, we 
give the one higher dimensional version of Corollary 2.4. Let 4 € /'(X). 
With the Ricci curvature in mind, we define Ric(A) to be the scalar 
valued form such that, with respect to an orthonormal frame ¢,, ...,¢,, 
and any vector field € we have 


Ric(A)(£) = )) <(DzDz, — Dz,Dz)A, &:): 


where we denote by <A, €> the value of a 1-form / on a vector field €. 


Proposition 4.6. Let €,, ...,€, be an orthonormal frame. As an opera- 
tor on 1-forms, A: of1(X) > 1(X) is given by 


A = —), D} - Ric. 
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Written in terms of the variables, this means 
<AA, o> = 2 (D;,Dz,4, 6) — ps <(D;Dz, — Dz,Dp)4, &>. 
Proof. By Proposition 4.5, we have 


d*) = —¥" (Dz,A)(E:) 


and so by a general formula on covariant derivatives we get a value for 
dd*A, namely 


<dd*i, €> = =) <D:Dz,A, §;>- 


On the other hand, to get d*d/, we first note that by COVD 6 of 
Chapter VIII, §1, 


Again by Proposition 4.5, 
<d*di, &> = ¥ <Dz,DzA, éi> ieee y (Dz, DzA, ¢>. 


Adding the two expressions yields the formula of the proposition. 


X, §5. HODGE DECOMPOSITION OF 
DIFFERENTIAL FORMS 


In this section we carry out a bit of pure algebra, applicable to the 
situation of the previous section, and also applicable to other situations, 
especially in the complex case. See for instance [Wel 80], pp. 147-148 
and [GriH 76], Chapter 0, §6. We work axiomatically. To prove the 
axioms H 1 and H 2 below requires more extensive analytical tools than 
we use in this book, and specifically it requires the basic theory of elliptic 
operators. What is needed is carried out in the above references, and the 
essential is done in a self-contained way in Appendix 4 of [La 75]. 

Since the algebraic set up which follows applies to other differential 
operators besides the d we have been using, I use a more neutral letter 
D, which in the complex theory is taken to be the so-called 6 operator. 

None of this section will be used in the rest of the book. It is 
included here only for the convenience of a reader wanting to see how 
the theory further develops, and to isolate clearly what is purely algebraic 
from what demands more differential analysis. 


280 VOLUME FORMS LX, §5] 


Let A be a vector space of dimension n over R, with a positive definite 
scalar product < , > and corresponding norm || ||; or alternatively, the 
vector space may be over C, with a positive definite hermitian product. 
Let 

D:A7A 
be a linear map which has an adjoint (algebraic) D*, that is 
<Du, v> = <u, D*v> all u, ve A. 
and such that DD = 0. We define the Laplacian of D to be 
Ap = A= DD* + D*D. 


We define Hp = H = ker A to be the D-harmonic space. We assume the 
Hodge Conditions: 


H 1. The kernel H = ker A is finite dimensional. 
H 2. We have H+ = AA. 


We then prove further properties as follows. 

Since H is assumed finite dimensional, there is an orthogonal projec- 
tion of A on H, which we denote also by H if necessary, that is H(u) is 
the orthogonal projection of u on H. 

Theorem 5.1. Under the above two Hodge conditions, we have 

H! = DA + D*A, 
and an orthogonal decomposition 
A=H1LAA=HLDA1D*<A. 
The restriction of A to H* is invertible, and 
Ker D=H + DA. 
Proof. By orthogonalization and H 2, given ue A we have 
u = Hu + Av = Hu + DD*v + D*Dv 


with some ve A. Hence A is contained in H + DA + D*A, so we get 
equality. Furthermore 


<Au, u> = ||Dull? + ||D*ull?. 
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Hence Au = 0 if and only if Du = D*u =0. (Each implication is immedi- 
ate.) The adjointness relation then shows that DA, D*A are orthogonal 
to H, and D* = 0 implies that DA is orthogonal to D*A, so we get the 
orthogonal decomposition 


A=H1DA1D*<A, 
and AA = DA + D*A by H 2. Since AH = 0 it follows that 
A: DA + D*A— DA + D*¥A 
is surjective, and so is an isomorphism, and thus A is invertible on H-. 


Finally H + DA is contained in the kernel of D, and D is injective on 
D*A because 


DD*u=0 => <DD*tuyuy=0 => | D*ul|? = 0. 
This proves the theorem. 


Remark 1. As a special case of the last formula, suppose ue A and u 
is perpendicular to Ker A. If u is D-closed, that is Du = 0, then u = Dv 
for some v € A, that is u is D-exact. 


Remark 2. If we denote by H(A) the homology ker D/Im D then we get 
an isomorphism of the homology with the harmonic space 


H ~ H(A). 
We let 
G:A4—>H't=AA 
be equal to 0 on H, and be the inverse of A on AA. Then by definition, 


GA = AG and I=AG+H. 
Furthermore: 


G and A commute with D and D*. 
Proof. We have 
AD = (DD* + D*D)D = DD*D and DA = D(DD* + D*D) = DD*D 


so D commutes with A. Similarly for D*. The commutation of D and 
D* with G then follows since G = A! on AA. 
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Graded structure 
Suppose in addition that A is graded, 
A= @ A’, 
p=0 
that A’ is orthogonal to A‘ for q # p, and that 
D? = D; AP > AP*} 
raises degrees by 1, so D*: A? > A?! lowers degrees by 1. 


Under the above assumptions, we can defined the homology of D in 
degree p to be 


H?(A) = Ker D?/Im D?™!, 


where D? is D viewed as map from A? to A?*!. Immediately from 
Theorem 5.1 we obtain: 


Theorem 5.2. Let H? = HH?(A). Then 


H=@” 


p=0 


and H?(A) = HP, that is every class in Ker D? mod Im D?™! has a unique 
representative in the harmonic space H?. 


The star operator 

We suppose given an automorphism S: A > A which is an isomorphism 
S: A? > A"?, 

We assume: 


S 1. On A? we have S? = (—1)P"~, 
S 2. D* =(—1)"*"*! on AP. 


Proposition 5.3. Under these assumptions, D = SD*S and H, A, G com- 
mute with S. 
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Proof. We give the proof when n is even for simplicity. For ue A?, 
we have: 


SD*Su = —S?DS?u = —S?D(—1)’u 
= —(-1)(-1" "Du 
= Du, 


so D = SD*S. 
For the commutation of S with A, we write, using the above, 


SA = —SDSDS — SSDSD, 
AS = —DSDSS — SDSDS. 
On A’, SS = (—1)?’, so it is immediate that SS commutes with DSD, thus 
showing that S commutes with A. 
Since S commutes with A, it follows that 
S:H-H 


induces an automorphism of H with itself. For ue A we have: 


Su — HSu eH by definition of the orthogonal projection; and 
Su — SHu = SAGu = ASGu since A commutes with S. 


Then 
Su — SHu LH since it lies in AA. 
Subtracting shows that HSu — SHu is both orthogonal to H, and also 


lies in H, so must be 0, whence H commutes with S. Since G=A™ on 
H! it follows that G also commutes with S, thus proving the proposition. 


CHAPTER XI 


Integration of Differential 
Forms 


The material of this chapter is also contained in my book on real 
analysis [La 93], but it may be useful to the reader to have it also here 
in a rather self contained way, based only on standard properties of 
integration in Euclidean space. 


Throughout this chapter, is Lebesgue measure on R". 
If A is a subset of R", we write #1(A) instead of L(A, p, C). 
All manifolds are assumed finite dimensional. 
They may have a boundary. 
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We recall that a set has measure 0 in R" if and only if, given «¢, there 
exists a covering of the set by a sequence of rectangles {R,} such that 
y' u(R;) < ¢. We denote by R; the closed rectangles, and we may always 
assume that the interiors R} cover the set, at the cost of increasing the 
lengths of the sides of our rectangles very slightly (an ¢/2" argument). We 
shall prove here some criteria for a set to have measure 0. We leave it to 
the reader to verify that instead of rectangles, we could have used cubes 
in our characterization of a set of a measure 0 (a cube being a rectangle 
all of whose sides have the same length). 

We recall that a map f satisfies a Lipschitz condition on a set A if 
there exists a number C such that 


f(x) — f(y)| S$ Clx — yl 
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for all x, ye A. Any C! map f satisfies locally at each point a Lipschitz 
condition, because its derivative is bounded in a neighborhood of each 
point, and we can then use the mean value estimate, 


f(x) — OM S 1x — yl sup |f'(I, 


the sup being taken for z on the segment between x and y. We can take 
the neighborhood of the point to be a ball, say, so that the segment 
between any two points is contained in the neighborhood. 


Lemma 1.1. Let A have measure 0 in R" and let f: AR" satisfy a 
Lipschitz condition. Then f(A) has measure 0. 


Proof. Let C be a Lipschitz constant for f. Let {R,} be a sequence of 
cubes covering A such that )\y(R,)<e. Let 1, be the length of the side 
of R;. Then for each j we see that f(A 7S,) is contained in a cube R; 
whose sides have length < 2Cr;. Hence 


H(Rj) S 2"C"r? = 2"C"p(R;). 
Our lemma follows. 


Lemma 1.2. Let U be open in R" and let f: U +R" be a C! map. Let 
Z be a set of measure 0 in U. Then f(Z) has measure 0. 


Proof. For each xe U there exists a rectangle R, contained in U such 
that the family {R2} of interiors covers Z. Since U is separable, there 
exists a denumerable subfamily covering Z, say {Rj}. It suffices to prove 
that f(ZR,) has measure 0 for each j. But f satisfies a Lipschitz 
condition on R; since R; is compact and f’ is bounded on R,, being 
continuous. Our lemma follows from Lemma 1.1. 


Lemma 1.3. Let A be a subset of R™. Assume that m <n. Let 

f:A7>R’ 

satisfy a Lipschitz condition. Then f(A) has measure 0. 

Proof. We view R™ as embedded in R" on the space of the first m co- 
ordinates. Then R” has measure 0 in R", so that A has also n-dimen- 
sional measure 0. Lemma 1.3 is therefore a consequence of Lemma 1.1. 

Note. All three lemmas may be viewed as stating that certain para- 


metrized sets have measure 0. Lemma 1.3 shows that parametrizing a set 
by strictly lower dimensional spaces always yields an image having mea- 
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sure 0. The other two lemmas deal with a map from one space into 
another of the same dimension. Observe that Lemma 1.3 would be false 
if f is only assumed to be continuous (Peano curves). 


The next theorem will be used later only in the proof of the residue 
theorem, but it is worthwhile inserting it at this point. 

Let f: X > Y be a morphism of class C?, with p21, and assume 
throughout this section that X, Y are finite dimensional. A point x eX 
is called a critical point of f if f is not a submersion at x. This means 
that 

Tf: T,X > Tyoy¥ 


is not surjective, according to our differrential criterion for a submersion. 

Assume that a manifold X has a countable base for its charts. Then 
we can say that a set has measure 0 in X if its intersection with each 
chart has measure 0. 


Theorem 1.4 (Sard’s Theorem). Let f: X > Y be a C®” morphism of 
finite dimensional manifolds having a countable base. Let Z be the set 
of critical points of f in X. Then f(Z) has measure 0 in Y. 


Proof. (Due to Dieudonné.) By induction on the dimension n of X. 
The assertion is trivial if n=0. Assume n= 1. It will suffice to prove 
the theorem locally in the neighborhood of a point in Z. We may 
assume that X = U is open in R" and 


f:U—>R? 
can be expressed in terms of coordinate functions, 
i = (fi, stele 


We let us usual 
D* = D31--- Di 


be a differential operator, and call |a|=a,+---+ 4, its order. We let 
Zo = Z and for m2 1 we let Z,, be the set of points x € Z such that 


D*f(x) = 0 
for all j = 1, ...,p and all « with 1 <|«|<m. We shall prove: 


(1) For each m= 0 the set f(Z,, — Zin+1) has measure 0. 
(2) If m=n/p, then f(Z,,) has measure 0. 


This will obviously prove Sard’s theorem. 
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Proof of 1. Let ae Z,,—Z,+,. Suppose first that m=0. Then for 
some coordinate function, say j= 1, and after a renumbering of the 
variables if necessary, we have 


D, fi (a) # 0. 
The map 


g: xH>(f, (x), X2; sagas) 


obviously has an invertible derivative at x =a, and hence is a local 
isomorphism at a. Considering fog instead of f, we are reduced to 
the case where f is given by 


F(x) = (x1, fol), : f(x) = (x1, h(x)), 


where h is the projection of f on the last p—1 coordinates and is 
therefore a morphism h: V > R?~! defined on some open V containing a. 


Then 
1 0 
Aare: ( Bie 


From this it is clear that x is a critical point for f if and only if x is a 
critical point for h, and it follows that h(ZV) has measure 0 in R??. 
Since f(Z) is contained in R} x h(Z), we conclude that f(Z) has measure 
0 in R? as desired. 
Next suppose that m2 1. Then for some a with |a| = m+ 1, and say 
j= 1, we have 
D*f,(a) #0. 


Again after a renumbering of the indices, we may write 
D*f, = Dig, 


for some function g,, and we observe that g,(x) =0 for all xe Z,,, in a 
neighborhood of a. The map 


g:xXt> (g(x), X25 eet) 


is then a local isomorphism at a, say on an open set V containing a, and 
we see that 
9(Zm OV) < {0} x R™. 


We view g as a change of charts, and considering fog! instead of f, 
together with the invariance of critical points under changes of charts, we 
may view f as defined on an open subset of R" 1. We can then apply 
induction again to conclude the proof of our first assertion. 
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Proof of 2. Again we work locally, and we may view f as defined on 
the closed n-cube of radius r centered at some point a. We denote this 
cube by C,(a). For m 2 n/p, it will suffice to prove that 


f(Zn > C,(a)) 


has measure 0. For large N, we cut up each side of the cube into N 
equal segments, thus obtaining a decomposition of the cube into N” 
small cubes. By Taylor’s formula, if a small cube contains a critical point 
xeéZ,,, then for any point y of this small cube we have 


f(y) — f@)| S K|x — yl" S$ K(2r/Ny"™, 


where K is a bound for the derivatives of f up to order m+ 1, and we 
use the sup norm. Hence the image of Z,, contained in small cube is 
itself contained in a cube whose radius is given by the right-hand side, 
and whose volume in R?” is therefore bounded by 


K?(2r/N)Pemt)), 
We have at most N” such images to consider and we therefore see that 
f(Zn 0 C,(@)) 


is contained in a union of cubes in R?, the sum of whose volumes is 
bounded by 


K?N"(2r/N)Pmry) < KP(2rypent) Na Pim +1), 

Since m= n/p, we see that the right-hand side of this estimate behaves 
like 1/N as N becomes large, and hence that the union of the cubes in 
R? has arbitrarily small measure, thereby proving Sard’s theorem. 

Sard’s theorem is harder to prove in the case f is C? with finite p 
[29], but p = oo already is quite useful. 
Xl, §2. CHANGE OF VARIABLES FORMULA 
We first deal with the simplest of cases. We consider vectors v,, ...,v,, in 
R" and we define the block B spanned by these vectors to be the set of 
points 


t,v, + saree, + 0,0, 


with 0 <t;< 1. We say that the block is degenerate (in R") if the vectors 
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V,, .--,U, are linearly dependent. Otherwise, we say that the block is 
non-degenerate, or is a proper block in R’. 


We see that a block in R? is nothing but a parallelogram, and a block in 
R? is nothing but a parallelepiped (when not degenerate). 
We shall sometimes use the word volume instead of measure when 

applied to blocks or their images under maps, for the sake of geometry. 
We denote by Vol(v,,...,v,) the volume of the block B spanned by 
U1, --.,0,- We define the oriented volume 


Vol®(v,,...,0,) = +Vol(v,,...50,)5 
taking the + sign if Det(v,,...,v,) > 0 and the — sign if 
Det(v,,...,v,) < 0. 


The determinant is viewed as the determinant of the matrix whose col- 
umn vectors are v,, ...,v,, in that order. 

We recall the following characterization of determinants. Suppose that 
we have a product 


(V1, ---,U,) OD, ADZ ATO AD, 


which to each n-tuple of vectors associates a number, such that the 
product is multilinear, alternating, and such that 


eA Ae, = 1 


if e,, ...,e, are the unit vectors. Then this product is necessarily the 
determinant, that is, it is uniquely determined. “Alternating” means that 
if v; = v; for some i # j, then 


v, A Av, = 9. 


The uniqueness is easily proved, and we recall this short proof. We can 
write 


v; = Qj, ey + aia Ainen 
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for suitable numbers a,,, and then 


Dy ANA Dy = (Ay ey Hot + Ay pOg) Att A (Qny ey + °°* + Ann €n) 


=) 91 ,0(1)€o(1) A" A An, anyon) 
o 


= > 41,01)" An,omyPa(t) A °° A Cony: 
o 


The sum is taken over all maps a: {1,...,n} > {1,...,n}, but because of 
the alternating property, whenever o is not a permutation the term corre- 
sponding to o is equal to 0. Hence the sum may be taken only over all 
permutations. Since 


Cg) N'A Cain) = (ae, Av** Ae, 


where e(c) = 1 or —1 os a sign depending only on 4a, it follows that the 
alternating product is completely determined by its value e, A--: A e,, 
and in particular is the determinant if this value is equal to 1. 


Proposition 2.1. We have 


Vol®(v,, ...,0,) = Det(v,, ...,0,) 
and 
vol(v,,...,0,) = |Det(v,, ...,v,)|. 


Proof. If v,, ...,v, are linearly dependent,then the determinant is equal 
to 0, and the volume is also equal to 0, for instance by Lemma 1.3. So 
our formula holds in the case. It is clear that 


Vol%(e,,...,€,) = 1. 


To show that Vol® satisfies the characteristic properties of the determi- 
nant, all we have to do now is to show that it is linear in each variable, 
say the first. In other words, we must prove 


(*) Vol°(cv, 02, ...,0,) = ¢ Vol?(v, v2, ...,0,) for ceR, 


(**) Vol°(v + w, v2, ...,0,) = Vol(v, v2, ...,0,) + Vol(w, v2, ...,0,). 


As to the first assertion, suppose first that c is some positive integer k. 
Let B be the block spanned by », v2, ...,v,. We may assume without 
loss of generality that v, v2, ...,v, are linearly independent (otherwise, the 
relation is obviously true, both sides being equal to 0). We verify at once 
from the definition that if B(v, v,,...,v,) denotes the block spanned by », 
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V2, ...,0, then B(kv, v2,...,v,) is the union of the two sets 
B((k — 1)v, v2,...,0,) and B(v, v2,...,v,) + (k — 1)v 


which have only a set of measure 0 in common, as one verifies at once 
from the definitions. 


Therefore, we find that 
Vol(kv, v2, ...,0,) = Vol((k — 1)v, v2, ...,0,) + Vol(v, v2, ...,0,) 
= (k — 1) Vol(v, v2, ...,0,) + Vol(v, v2, ...,0,) 
=k Vol(v, v2, ...,0,) 
as was to be shown. 
Now let 
v =0,/k 


for a positive integer k. Then applying what we have just proved shows 
that 


1 1 
vol( Fo. v2, art) = k Vol(,, ---50,)- 


Writing a positive rational number in the form m/k = m-1/k, we conclude 
that the first relation holds when c is a positive rational number. If r is 
a positive real number, we find positive rational numbers c, c’ such that 
c<rsxc’. Since 

B(cv, v2, ...,0,) < B(rv, v2, ..-50,)  B(c’v, v2, .--50_), 


we conclude that 


c Vol(v, 02, ...50,) S Vol(rv, v2, ...,0,) Sc’ Vol(v, v2, ...,0,). 
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Letting c, c’ approach r as a limit, we conclude that for any real number 
r = 0 we have 


Vol(rv, v2, ...,0,) =r Vol(v, v2, ...,0,). 
Finally, we note that B(—v, v,,...,v,) is the translation of 
Biv, v2, ..-5Up) 


by —v so that these two blocks have the same volume. This proves the 
first assertion. 

As for the second, we look at the geometry of the situation, which is 
made clear by the following picture in case v = v1, w= 0p. 


The block spanned by v,, v2, ... consists of two “triangles” T, T’ having 
only a set of measure zero in common. The block spanned by v, + v, 
and v, consists of T’ and the translation T+ v,. It follows that these 
two blocks have the same volume. We conclude that for any number c, 


Vol? (v, + C02, 02, ..+50_) = VOl?(v,, 02, .-.50,)- 
Indeed, if c = 0 this is obvious, and if c 4 0 then 


c Vol®(v, + cv, v2) = Vol®(v, + cv2, cv2) 
= Vol°(v, + cv2) = c Vol*(v,, v2). 
We can then cancel c to get our conclusion. 
To prove the linearity of Vol® with respect to its first variable, we may 
assume that v,, ...,v, are linearly independent, otherwise both sides of 


(*«) are equal to 0. Let v, be so chosen that {v,,...,v,} is a basis of R”. 
Then by induction, and what has been proved above, 


Vol?(cp 0, + 77° + CyU ps 25 +++5Un) 
= Vol?(cy 0, + 55+ + Cy—1Up—1 > Vas +++ Vn) 
= Vol9(c,01, 02, ..-50,) 
= C, Vol(v,,...,0,)- 


From this the linearity follows at once, and the theorem is proved. 
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Corollary 2.2. Let S be the unit cube spanned by the unit vectors in R". 
Let 1: R" > R" be a linear map. Then 


Vol A(S) = |Det(A)]. 
Proof. If v,, ...,v, are the images of e,, ...,e, under A, then A(S) is the 
block spanned by v,, ...,v,. If we represent 1 by the matrix A = (a,), 


then 
Ui = Aye, T° + Aniey, 


and hence Det(v,,...,v,) = Det(A) = Det(A). This proves the corollary. 
Corollary 2.3. If R is any rectangle in R" and 1: R"—R" is a linear 
map, then 


Vol A(R) = |Det(A)| Vol(R). 


Proof. After a translation, we can assume that the rectangle is a 
block. If R = 1,(S) where S is the unit cube, then 


A(R) = Ao A,(S), 
whence by Corollary 2.2, 
Vol A(R) = |Det(A o 4,)| = |Det(A) Det(A,)| = |Det(2)| Vol(R). 

The next theorem extends Corollary 2.3 to the more general case 
where the linear map / is replaced by an arbitrary C‘-invertible map. 
The proof then consists of replacing the linear map by its derivative and 
estimating the error thus introduced. For this purpose, we have the 
Jacobian determinant 


A,(x) = Det J,(x) = Det f’(x), 


where J,(x) is the Jacobian matrix, and f’(x) is the derivative of the map 
f:U->R". 


Proposition 2.4. Let R be a rectangle in R", contained in some open set 
U. Let f: UR" be a C' map, which is C'-invertible on U. Then 


u(f(R)) = I, JAy| du. 


Proof. When f is linear, this is nothing but Corollary 2.3 of the 
preceding theorem. We shall prove the general case by approximating f 
by its derivative. Let us first assume that R is a cube for simplicity. 
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Given ¢, let P be a partition of R, obtained by dividing each side of R 
into N equal segments for large N. Then R is partitioned into N" 
subcubes which we denote by S; (j =1,...,N"). We let a; be the center 
of S;. 

We have 


Vol f(R) = 5° Vol f(S;) 


because the images f(S,) have only sets of measure 0 in common. We 
investigate f(S;) for each j. The derivative f’ is uniformly continuous on 
R. Given ¢, we assume that N has been taken so large that for x € S; we 
have 


F(x) = f(G) + A(x — aj) + O(x — a;), 
where 4; = f’(a;) and 
lp(x — a;)| S|x— a; 


To determine Vol f(S;) we must therefore investigate f(S) where S is a 
cube centered at the origin, and f has the form 


f(x) = Ax + P(x), |e) S |xIe. 


on the cube S. (We have made suitable translations which don’t affect 
volumes.) We have 


A* 0 f(x) =x +4 0 Ox), 
so that 47! o f is nearly the identity map. For some constant C, we have 
for xeS 


|A~* 0 g(x) S Ce. 


From the lemma after the proof of the inverse mapping theorem, we 
conclude that 17! o f(S) contains a cube of radius 


(1 — Ce)(radius S), 
and trivial estimates show that A7! o f(S) is contained in a cube of radius 


(1 + Ce)(radius S). 


We apply 4 to these cubes, and determine their volumes. Putting indices 
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j on everything, we find that 


|Det f’(a,)| Vol(S;) — eC, Vol(S)) 
< Vol f(S,) < [Det f"(a))| Vol(S;) + eC, Vol(S,) 


with some fixed constant C,. Summing over j and estimating |A,|, we 
see that our theorem follows at once. 


Remark. We assumed for simplicity that R was a cube. Actually, by 
changing the norm on each side, multiplying by a suitable constant, and 
taking the sup of the adjusting norms, we see that this involves no loss 
of generality. Alternatively, we can approximate a given rectangle by 
cubes. 


Corollary 2.5. If g is continuous on f(R), then 


| gdp = | (g°f)|Ay| dy. 
F(R) R 


Proof. The functions g and (go f)|A,| are uniformly continuous on 
f(R) and R respectively. Let us take a partition of R and let {S;} be the 
subrectangles of this partition. If 6 is the maximum length of the sides of 
the subrectangles of the partition, then f(S;) is contained in a rectangle 
whose sides have length < Cod for some constant C. We have 


| adu=5 | g du. 
S(R) J J f(Sj) 


The sup and inf of g of f(S;) differ only by « if 6 is taken sufficiently 
small. Using the theorem, applied to each S;, and replacing g by its 
minimum m, and maximum M, on S,, we see that the corollary follows at 
once. 


Theorem 2.6 (Change of Variables Formula). Let U be open in R" and 
let f: UR" be a C! map, which is C' invertible on U. Let g be in 
L"(f(U)). Then (go f)|Ay,| is in '(U) and we have 


| g du = | (9° f)|As| du. 
F(U) U 


Proof. Let R be a closed rectangle contained in U. We shall first 
prove that the restriction of (go f)|A,| to R is in #*(R), and that the 
formula holds when U is replaced by R. We know that C,(f(U)) is 
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L*-dense in #1(f(U)), by [La 93], Theorem 3.1 of Chapter IX. Hence 
there exists a sequence {g,} in C,(f(U)) which in L'-convergent to g. 
Using [La 93], Theorem 5.2 of Chapter VI, we may assume that {g,} 
converges pointwise to g except on a set Z of measure 0 in f(U). By 
Lemma 1.2, we know that f~1(Z) has measure 0. 

Let gf =(9,° f)|A,|. Each function g¥ is continuous on R. The se- 
quence {g¥} converges almost everywhere to (go f )|A;| restricted to R. 
It is in fact an L'-Cauchy sequence in #1(R). To see this, we have by 
the result for rectangles and continuous functions (corollary of the pre- 
ceding theorem): 


| lgt — gml dp = | lDx — Im| 4p, 
R S(R) 


so the Cauchy nature of the sequence {g¥} is clear from that of {g,}. It 
follows that the restriction of (go f )|A,| to R is the L'-limit of {g¥}, and 
is in #'(R). It also follows that the formula of the theorem holds for R, 
that is 


| gdu= | (g° f)|A,r| du 
S(A) A 


when A = R. 

The theorem is now seen to hold for any measurable subset A of R, 
since f(A) is measurable, and since a function g in £1(f(A)) can be 
extended to a function in #1(f(R)) by giving it the value 0 outside F(A). 
From this it follows that the theorem holds if A is a finite union of 
rectangles contained in U. We can find a sequence of rectangles {R,,} 
contained in U whose union is equal to U, because U is separable. 
Taking the usual stepwise complementation, we can find a disjoint se- 
quence of measurable sets 


An = Rm — (Ry U*** U Rat) 


whose union is U, and such that our theorem holds if A = A,,. Let 


hn = Df(Am) = IXf(Am) and hy = (hp © S)IAsI- 


Then )'h,, converges to g and )'h* converges to (go f )|A;|. Our theo- 
rem follows from Corollary 5.13 of the dominated convergence theorem 
in [La 93]. 


Note. In dealing with polar coordinates or the like, one sometimes 
meets a map f which is invertible except on a set of measure 0, e.g. the 
polar coordinate map. It is now trivial to recover a result covering this 
type of situation. 
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Corollary 2.7. Let U be open in R" and let f: UR" be a C' map. 
Let A be a measurable subset of U such that the boundary of A has 
measure 0, and such that f is C! invertible on the interior of A. Let g 
be in £1(f(A)). Then (go f)|A,| is in Y*(A) and 


| aiu= | (go f)|A,| du. 
S(A) A 


Proof. Let U, be the interior of A. The sets f(A) and f(U,) differ 
only by a set of measure 0, namely f(@A). Also the sets A, Up differ only 
by a set of measure 0. Consequently we can replace the domains of 
integration f(A) and A by f(U)) and Up, respectively. The theorem 
applies to conclude the proof of the corollary. 


XI, §3. ORIENTATION 


Let U, V be open sets in half spaces of R" and let g: U-V be a 
C! isomorphism. We shall say that is orientation preserving if the 
Jacobian determinant A,(x) is > 0, all xeU. If the Jacobian determi- 
nant is negative, then we say that @ is orientation reversing. 

Let X be a C? manifold, p = 1, and let {(U;, g,)} be an atlas. We say 
that this atlas is oriented if all transition maps g,° g;' are orientation 
preserving. Two atlases {(U;,,,)} and {(V,,W,)} are said to define the 
same orientation, or to be orientation equivalent, if their union is oriented. 
We can also define locally a chart (V, w) to be orientation compatible with 
the oriented atlas {(U;, @,)} if all transition maps g, 0 g' (defined when- 
ever U,V is not empty) are orientation preserving. An orientation 
equivalence class of oriented atlases is said to define an oriented mani- 
fold, or to be an orientation of the manifold. It is a simple exercise to 
verify that if a connected manifold has an orientation, then it has two 
distinct orientations. 

The standard examples of the Moebius strip or projective plane show 
that not all manifolds admit orientations. We shall now see that the 
boundary of an oriented manifold with boundary can be given a natural 
orientation. 


Let @: UR" be an oriented chart at a boundary point of X, such 
that: 


(1) if (x,,...,x,) are the local coordinates of the chart, then the 
boundary points correspond to those points in R" satisfying x, = 0; 
and 

(2) the points of U not in the boundary have coordinates satisfying 
x, <0. 
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Then (x,,...,x,) are the local coordinates for a chart of the boundary, 
namely the restriction of g to 0X MU, and the picture is as follows. 


(x » Xn) 


We may say that we have considered a chart @ such that the manifold 
lies to the left of its boundary. If the reader thinks of a domain in R?, 
having a smooth curve for its boundary, as on the following picture, the 
reader will see that our choice of chart corresponds to what is usually 
visualized as “counterclockwise” orientation. 


The collection of all pairs (UA 6X, e|\(UN0X)), chosen according to 
the criteria described above, is obviously an atlas for the boundary 0X, 
and we contend that it is an oriented atlas. 


We prove this easily as follows. If 


(X1,---,X,) =X and (Vi5-++Vn) =V 


are coordinate systems at a boundary point corresponding to choices of 
charts made according to our specifications, then we can write y = f(x) 
where f =(/,,...,f,) is the transition mapping. Since we deal with 
oriented charts for X, we know that A,(x)>0 for all x. Since f maps 
boundary into boundary, we have 


fi (0, X25 +++5Xq) =0 


for all x,, ...,x,. Consequently the Jacobian matrix of f at a point 
(0, x,...,X,) 1s equal to 


D, f,(0, X2,.--X,) Ove 0 
a 
* Aen?) ’ 


* 
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where A"~)) is the Jacobian matrix of the transition map g induced by f 
on the boundary, and given by 


y2 = f,(0, X25 wens Xp h 


Yn = Sn(O, X25 +++ s%n)- 


However, we have 


eee 
D, f,(0, X2; 12+ 5X,) —! lim — 
h 


70 


taking the limit with h <0 since by prescription, points of X have coor- 
dinates with x, <0. Furthermore, for the same reason we have 


Fil, X25 +++5Xq_) < 0. 
Consequently 
D, f,(0, x2, ...5X,) > 0. 


From this it follows that A’"(x,,...,x,) > 0, thus proving our assertion 
that the atlas we have defined for 0X is oriented. 


From now on, when we deal with an oriented manifold, it is understood 
that its boundary is taken with orientation described above, and called the 
induced orientation. 


XI, §4. THE MEASURE ASSOCIATED WITH 
A DIFFERENTIAL FORM 


Let X be a manifold of class C? with p21. We assume from now on 
that X is Hausdorff and has a countable base. Then we know that X 
admits C? partitions of unity, subordinated to any given open covering. 


(Actually, instead of the conditions we assumed, we could just as well 
have assumed the existence of C? partitions of unity, which is the precise 
condition to be used in the sequel.) 

We can define the support of a differential form as we defined the 
support of a function. It is the closure of the set of all xe X such that 
w(x) #0. If w is a form of class C? and « is a C4 function on X, then 
we can form the product aw, which is the form whose value at x is 
a(x)c(x). If « has compact support, then «@ has compact support. Later, 
we shall study the integration of forms, and reduce this to a local problem 
by means of partitions of unity, in which we multiply a form by functions. 
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We assume that the reader is familiar with the correspondence be- 
tween certain functionals on continuous functions with compact support 
and measures. Cf. [La 93] for this. We just recall some terminology. 

We denote by C,(X) the vector space of continuous functions on X 
with compact support (ic. vanishing outside a compact set). We write 
CX, R) or C,(X, C) if we wish to distinguish between the real or com- 
plex valued functions. 

We denote by C,(X) the subspace of C,(X) consisting of those func- 
tions which vanish outside K. (Same notation C,(X) for those functions 
which are 0 outside any subset S of X. Most of the time, the useful 
subsets in this context are the compact subsets K.) 

A linear map 4 of C,(X) into the complex numbers (or into a normed 
vector space, for that matter) is said to be bounded if there exists some 
C = 0 such that we have 


lAf| = CIlf|l 


for all f ¢ C,(X). Thus Ais bounded if and only if 4 is continuous for the 
norm topology. 

A linear map 4 of C,(X) into the complex numbers is said to be 
positive if we have Af = 0 whenever f is real and 2 0. 


Lemma 4.1. Let 1:C,(X)—C be a positive linear map. Then 1 is 
bounded on Cx(X) for any compact K. 


Proof. By the corollary of Urysohn’s lemma, there exists a continuous 
real function g 20 on X which is 1 on K has compact support. If 
feC,(X), let b = || f ||. Say f is real. Then bg + f 2 0, whence 


A(bg) + Af 20 
and |Af| < bA(g). Thus dg is our desired bound. 


A complex valued linear map on C,(X) which is bounded on each 
subspace C,(X) for every compact K will be called a C,-functional on 
C,(X), or more simply, a functional. A functional on C,(X) which is also 
continuous for the sup norm will be called a bounded functional. It is 
clear that a bounded functional is also a C,-functional. 


Lemma 4.2. Let {W,} be an open covering of X. For each index a, let 
A, be a functional on C,(W,). Assume that for each pair of indices a, B 
the functionals 2, and A, are equal on C(W,0 Wy). Then there exists a 
unique functional 4 on X whose restriction to each C,(W,) is equal to A,. 
If each /, is positive, then so is A. 
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Proof. Let f€C,(X) and let K be the support of f. Let {h;} be a 
partition of unity over K subordinated to a covering of K by a finite 
number of the open sets W,. Then each h;f has support in some W,, 
and we define 


Af = Dy Agi hif ). 


We contend that this sum is independent of the choice of «(i), and also 
of the choice of partition of unity. Once this is proved, it is then obvious 
that 2 is a functional which satisfies our requirements. We now prove 
this independence. First note that if W,,, is another one of the open sets 
W, in which the support of h;f is contained, then h;f has support in the 
intersection Wy; W,,;, and our assumption concerning our functionals 
A, Shows that the corresponding term in the sum does not depend on the 
choice of index a(i). Next, let {g,} be another partition of unity over K 
subordinated to some covering of K by a finite number of the open sets 
W,. Then for each i, 


hf = Z IS, 


whence 


py Aynlhif) = > x Au (Ghif) 


If the support of g,h,f is in some W,, then the value 1,(g,h;f) is indepen- 
dent of the choice of index « The expression on the right is then 
symmetric with respect to our two partitions of unity, whence our theo- 
rem follows. 


Theorem 4.3. Let dim X =n and let w be an n-form on X of class C°, 
that is continuous. Then there exists a unique positive functional A on 
C,(X) having the following property. If (U, ) is a chart and 


(x) = f(x) dx, A °+: A dx, 


is the local representation of w in this chart, then for any g € C,(X) with 
support in U, we have 


(1) 4g = | Go(x)| f(x) dx, 


where g, represents g in the chart [i.e. g,(x) = g(y *(x))], and dx is 
Lebesgue measure. 


Proof. The integral in (1) defines a positive functional on C,(U). The 
change of variables formula shows that if (U,@) and (V,w) are two 
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charts, and if g has support in UV, then the value of the functional is 
independent of the choice of charts. Thus we get a positive functional by 
the general localization lemma for functionals. 


The positive measure corresponding to the functional in Theorem 4.3 
will be called the measure associated with |w|, and can be denoted by 
Mol: 

Theorem 4.3 does not need any orientability assumption. With such 
an assumption, we have a similar theorem, obtained without taking the 
absolute value. 


Theorem 4.4. Let dim X =n and assume that X is oriented. Let w be 
an n-form on X of class C®. Then there exists a unique functional A on 
C.(X) having the following property. If (U, @) is an oriented chart and 


(x) = f(x) dx, A-+*: A dx, 


is the local representation of w in this chart, then for any g € C,(X) with 
support in U, we have 


Ag = | ‘ Jo(x) f(x) ax, 


where g, represents g in the chart, and dx is Lebesgue measure. 


Proof. Since the Jacobian determinant of transition maps belonging to 
oriented charts is positive, we see that Theorem 4.4 follows like Theorem 
4.3 from the change of variables formula (in which the absolute value 
sign now becomes unnecessary) and the existence of partitions of unity. 


If 4 is the functional of Theorem 4.4, we shall call it the functional 
associated with w. For any function g € C,(X), we define 


| go = dg. 
x 


If in particular @ has compact support, we can also proceed directly as 
follows. Let {a;} be a partition of unity over X such that each a, has 
compact support. We define 


fo-% jae 


all but a finite number of terms in this sum being equal to 0. As usual, 
it is immediately verified that this sum is in fact independent of the 
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choice of partition of unity, and in fact, we could just as well use only a 
partition of unity over the support of w. Alternatively, if « is a function 
in C,(X) which is equal to 1 on the support of @, then we could also 


define 
| (0) -| aa. 
xX Xx 


It is clear that these two possible definitions are equivalent. In particu- 
lar, we obtain the following variation of Theorem 4.4. 


Theorem 4.5. Let X be an oriented manifold of dimension n. Let 
"(X) be the R-space of differential forms with compact support. There 
exists a unique linear map 


on| 7) of B@i(xX)>R 
x 


such that, if w has support in an oriented chart U with coordinates x,, 
see5X, and w(x) = f(x) dx, A ++: A dx, in this chart, then 


| o= | f(x) dx, ++: dx,. 


Let X be an oriented manifold. By a volume form Q we mean a form 
such that in every oriented chart, the form can be written as 


Q(x) = f(x) dxy A+: A dx, 


with f(x) >0 for all x. In the next section, we shall see how to get a 
volume form from a Riemannian metric. Here, we shall consider the 
non-oriented case to get the notion of density. 

Even when a manifold is not orientable, one may often reduce certain 
questions to the orientable case, because of the following result. We 
assume that readers are acquainted with basic facts about coverings. 


Proposition 4.6. Let X be a connected C' manifold. If X is not ori- 
entable, then there exists a covering X'— X of degree 2 such that X' is 
orientable. 


Sketch of Proof. Suppose first that X is simply connected. Let xe X. 
Fix a chart (Up, @o) at x such that the image of the chart is an open ball 
in euclidean space. Let y be any point of X, and let «a: [a,b] X bea 
piecewise C! path from x to y. We select a sufficiently fine partition 


[a, bj a [to, ti, oe vital, 
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and open sets U,; containing «([t;, t;,,]), such that U, has an isomorphism 
g; onto an open ball in euclidean space, and such that the charts , and 
@;4, have the same orientation. It is easy to verify that if two paths are 
homotopic, then the charts which we obtain at y by “continuation” as 
above along the two paths are orientation equivalent. This is done first 
for paths which are “close together,” and then extended to homotopic 
paths, according to the standard technique which already appears in 
analytic continuation. Thus fixing one orientation in the neighborhood 
of a give point determines an orientation on all of X when X is simply 
connected. 

Now suppose X not simply connected, and let X be its universal 
covering space. Let I be the fundamental group. Then the subgroup of 
elements y ¢ I which preserve an orientation of X is of index 2, and the 
covering corresponding to this subgroups has degree 2 over X and can 
be given an orientation by using charts which lift to oriented charts in 
the universal covering space. This concludes the proof. 


Densities 


The rest of this section will not be used, especially not for Stokes’ theo- 
rem in the next chapter. However, Theorem 4.3 for the non-orientable 
case is important for other applications, and we make further comments 
about this other context. 

Let s be a real number. Let E be a finite dimensional vector space 
over R, of dimension n. We denote by E* the set of non-zero elements of 
E, and by /\" E* the set of non-zero elements of a E. By an s-density 
on E we mean a function 


6: N' E* +R such that d(cw) = |c|°d(w) 


for all c#0 in R and we (" E*. Equivalently, we could say that there 
exists an n-form w € L?(E, R) such that for v; e E we have 


O(v, Avs A Uy) = |@(v,,---50,)/* 


We let den‘(E) denote the set of densities of E. An element of den‘(E) 
amounts to picking a basis of A"E, up to a factor +1, and assigning a 
number to this basis. 

Let U be an open subset of E. By a C? density on U we mean a C? 
morphism 6: U >den‘(E). Note that den‘(E) is an open half line, so a 
density on U amounts to selecting a differential form of class C? on a 
neighborhood of each point of U, such that the absolute values of these 
forms coincide on intersections of these neighborhoods. 
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Let f: U—-V be a C? isomorphism. Then f induces a map on 
densities, by the change of variable formula on forms with the Jacobian 
determinant, and then taking absolute values to the s-power. Thus we 
may form the bundle (not vector bundle) of densities, with charts 


U x den‘(E) 


over U, and density-bundle morphisms just as we did with differential 
forms. For example, let E = R", with coordinates x,, ...,x,. Then 


|dx, A+t: A dx,| = dx,°++ dx, 


defines a 1-density, and |dx, A --- A dx,|*° defines an s-density, denoted by 
|dx|*. 

Observe that s-densities form a cone, ie. if 6,, 6, are s-densities on a 
manifold X, and a,, a,¢R* (the set of positive real numbers), then 
a,6, + a,6, is also an s-density. In particular, continuing to assume that 
X admits continuous partitions of unity, we can reformulate and prove 
Theorem 4.3 for densities. Indeed, the differential form in Theorem 4.3 
need not be globally defined, because one needs only its absolute value 
to define the integral. Thus with the language of densities, Theorem 4.3 
reads as follows. 


Theorem 4.7. Let 5 be a C° density on X, i.e. a continuous density. 
Then there is a positive functional 4 on C,(X) having the following 
property. If U is a chart and 6 is represented by the density 
| f(x) dx, A+++ A dx,| on this chart, then for any function @ € C(U) we 
have 


Mo) = : Pulx)|F(x)| dx1 +++ dx,, 


where dx,:::dx, is the usual symbol for ordinary integration on R", and 
@y is the representation of @ in the chart. 


Examples. We have already given the example of integration with 
respect to |dx, A °:: A dx,| in euclidean space. Here is a less trivial 
example. Let X be a Riemannian manifold of finite dimension n, with 
Riemannian metric g. Locally in a chart U, we view g as a morphism 


g: ULE, E), 


with E having a fixed positive definite scalar product. With respect to an 
orthonormal basis, we have a linear metric isomorphism E = R", and g(x) 
at each point x can be represented by a matrix (gi(x)). If we put 


6(x) = |det gj(x)| dx, --- dx, 
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then 6 defines a density, called the Riemannian density; and 
6*?(x) = |det gij()|*” dx, ++ dx, 
defines the Riemannian half density. 


Remark. Locally, a manifold is always orientable. Hence a formula or 
result which is local, and is proved in the orientable case, also applies to 
densities, sometimes by inserting an absolute value sign. For example, 
Proposition 1.2 of Chapter X applies after inserting absolute value signs, 
but Proposition 2.1 of Chapter X applies as stated for the Riemannian 
density instead of the Riemannian volume. 


CHAPTER XIl 


Stokes’ Theorem 


Throughout the chapter, all manifolds are assumed finite 
dimensional. They may have a boundary. ° 


XIl, §1. STOKES’ THEOREM FOR A 
RECTANGULAR SIMPLEX 


If X is a manifold and Y a submanifold, then any differential form on X 
induces a form on Y. We can view this as a very special case of the 
inverse image of a form, under the embedding (injection) map 


id: Yo X. 


In particular, if Y has dimension n — 1, and if (x,,...,x,) is a system of 
coordinates for X at some point of Y such that the points of Y corre- 
spond to those coordinates satisfying x; = c for some fixed number c, and 
index j, and if the form on X is given in terms of these coordinates by 


(x) = f(X1,.--5X_,) dX, Avt7 A dX,, 


then the restriction of w to Y (or the form induced on Y) has the repre- 
sentation 


WN 
F(X 45662 9€y 00 sXp_) AX, Aves A dX; Awt* A dXy. 
We should denote this induced form by ay, although occasionally we 


omit the subscript Y. We shall use such an induced form especially when 
Y is the boundary of a manifold X. 
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Let 
R= [a,, by] xT xX [a,, | 


be a rectangle in n-space, that is a product of n closed intervals. The set 
theoretic boundary of R consists of the union over all i= 1, ...,n of the 
pieces 

R? = [a,, bi] x ++: x {a;} x ++ x {a,, By}, 


R} = [a,, by] x +++ x {bj} x + x Lay, By]. 
If ue 
(X15 6+ 5Xq) = f(% 4, +. 5Xq) dH Avs A ax; Aot+ A dX, 


is an (n — 1)-form, and the roof over anything means that this thing is to 
be omitted, then we define 


“N 
by bi by "IN 
o= ee et IL OgyscsOig seek, Oop ag aes 
RO a; a; ay 


if i=j, and 0 otherwise. And similarly for the integral over R}. We 
define the integral over the oriented boundary to be 


Jaa Me lah 


Stokes’ Theorem for Rectangles. Let R be a rectangle in an open set U 
in n-space. Let w be an (n — 1)-form on U. Then 


| dw -( o. 
R aR 


Proof. In two dimensions, the picture looks like this: 


It suffices to prove the assertion when w is a decomposable form, say 


(x) = f(x,,.-.5.X,) dx, Act n dx, A “++ A dX,. 
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We then evaluate the integral over the boundary of R. If i #j, then it is 
clear that 


so that 


| a 
OR 
aN 
maz by bn n~ 
eal | -(" LA (15 02 js 0-0 Xn) — S015 6 Djs eX pq) dx + dx; 7 dX,. 


On the other hand, from the definitions we find that 


el 
* ax 


of “~ 
da(x) = (2 dx,+° =i ellie 


ef ye 


dx, A**: A dx,. 


(The (—1)’~* comes from interchanging dx, with dx,, ...,dx,-,. All other 
terms disappear by the alternating rule.) 

Integrating dw over R, we may use repeated integration and integrate 
of/dx; with respect to x, first. Then the fundamental theorem of calculus 
for one variable yields 


i x AX; = f (X15. + Djy 06 05% pq) — LO 1, «Ay oe Xp) 


We then integrate with respect to the other variables, and multiply by 
(—1)"!. This yields precisely the value found for the integral of w over 
the oriented boundary 6°R, and proves the theorem. 


Remark. Stokes’ theorem for a rectangle extends at once to a version 
in which we parametrize a subset of some space by a rectangle. Indeed, 
if ¢: RV is a C' map of a rectangle of dimension n into an open set V 
in R¥, and if w is an (n — 1)-form in V, we may define 


| do = | o* do. 

o R 

| o= | o*a, 
da OOR 


One can define 
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and then we have a formula 


In the next section, we prove a somewhat less formal result. 


XIl, §2. STOKES’ THEOREM ON A MANIFOLD 


Theorem 2.1. Let X be an oriented manifold of class C?, dimension n, 
and let w be an (n—1)-form on X, of class C'. Assume that w has 


compact support. Then 
| dw = | o. 
X ox 


Proof. Let {a;};-, be a partition of unity, of class C?. Then 
YY 4,0 = a, 


iel 


and this sum has only a finite number of non-zero terms since the 
support of @ is compact. Using the additivity of the operation d, and 
that of the integral, we find 


I. dw = D i, d(a;c). 


Suppose that «; has compact support in some open set V; of X and that 


we can prove 
| d(a;@) = | a;, 
V; Vinox 


in other words we can prove Stokes’ theorem locally in V;. We can write 


| F160) => | a, 
VizN6x 6x 
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| ao) = | d(a;q). 
V; xX 


Using the additivity of the integral once more, we get 


and similarly 


| dow=) | dao)= > a0 = | , 
x xX iel J 0x 


iel Ox 


which yields Stokes’ theorem on the whole manifold. Thus our argument 
with partitions of unity reduces Stokes’ theorem to the local case, namely 
it suffices to prove that for each point of X these exists an open neigh- 
borhood V such that if @ has compact support in V, then Stokes’ theo- 
rem holds with X replaced by V. We now do this. 

If the point is not a boundary point, we take an oriented chart (U, ¢) 
at the point, containing an open neighborhood V of the point, satisfying 
the following conditions: @U is an open ball, and @V is the interior of a 
rectangle, whose closure is contained in gU. If w has compact support 
in V, then its local representation in @U has compact support in pV. 
Applying Stokes’ theorem for rectangles as proved in the preceding sec- 
tion, we find that the two integrals occurring in Stokes’ formula are 
equal to 0 in this case (the integral over an empty boundary being equal 
to 0 by convention). 

Now suppose that we deal with a boundary point. We take an 
oriented chart (U, ¢) at the point, having the following properties. First, 
gU is described by the following inequalities in terms of local coordi- 
nates (X,,...,X,): 


—2<x,<1 and —2<x,;<2 for j=2,...,n. 


Next, the given point has coordinates (1,0,...,0), and that part of U on 
the boundary of X, namely Un ¢X, is given in terms of these coordi- 
nates by the equation x;=1. We then let V consist of those points 
whose local coordinates satisfy 


O0<x,<1 and —1<x,<1 for j=2,...,n. 


If @ has compact support in V, then @ is equal to 0 on the boundary of 
the rectangle R equal to the closure of pV, except on the face given by 
x, = 1, which defines that part of the rectangle corresponding to dX 0 V. 
Thus the support of w looks like the shaded portion of the following 
picture. 
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In the sum giving the integral over the boundary of a rectangle as in the 
previous section, only one term will give a non-zero contribution, corre- 
sponding to i = 1, which is 


ba ae) a 


Furthermore, the integral over R? will also be 0, and in the contribution 
of the integral over R!, the two minus signs will cancel, and yield the 
integral of w over the part of the boundary lying in V, because our 
charts are so chosen that (x,,...,X,,) 1s an oriented system of coordinates 
for the boundary. Thus we find 


| do = | Q, 
4 Vex 


which proves Stokes’ theorem locally in this case, and concludes the 
proof of Theorem 2.7. 


Corollary 2.2. Suppose X is an oriented manifold without boundary, and 
@ has compact support. Then 


| dw = 0. 
xX 


For any number of reasons, some of which we consider in the next 
section, it is useful to formulate conditions under which Stokes’ theorem 
holds even when the form does not have compact support. We shall 
say that w has almost compact support if there exists a decreasing se- 
quence of open sets {U,} in X such that the intersection 


(\U 
k=1 
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is empty, and a sequence of C' functions {g,}, having the following 
properties: 


AC 1. We have 0S 9, <1, g, =1 outside U,, and g,@ has compact 
support. 
AC 2. If us, is the measure associated with |dg, A w| on X, then 


lim u,(U;,) = 0. 
koa 
We then have the following application of Stokes’ theorem. 


Corollary 2.3. Let X be a C? oriented manifold, of dimension n, and let 
ow be an (n—1)-form on X, of class C'. Assume that w has almost 
compact support, and that the measures associated with |dw| on X and 


|w| on 0X are finite. Then 
| da = | Qo. 
x ax 


Proof. By our standard form of Stokes’ theorem we have 


| no=| itao)= | in. 0+ | 9, do. 
ax x x x 


We estimate the left-hand side by 


| @ -| G,O 
ax x 


Since the intersection of the sets U, is empty, it follows for a purely 
measure-theoretic reason that 


im | n0-| o. 
k>o J ax ox 


lim | 9g, dw = | do. 
ka JX X 


The integral of dg, A @ over X approaches 0 as k > by assumption, 
and the fact that dg, A @ is equal to 0 on the complement of U, since g, 
is constant on this complement. This proves our corollary. 


= I, (1 — sa SZ My (U, 0 OX). 
».¢ 


Similarly, 


The above proof shows that the second condition AC 2 is a very 
natural one to reduce the integral of an arbitrary form to that of a form 
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with compact support. In the next section, we relate this condition to a 
question of singularities when the manifold is embedded in some bigger 
space. 


XIl, §3. STOKES’ THEOREM WITH SINGULARITIES 


If X is a compact manifold, then of course every differential form on X 
has compact support. However, the version of Stokes’ theorem which we 
have given is useful in contexts when we start with an object which is 
not a manifold, say as a subset of R", but is such that when we remove a 
portion of it, what remains is a manifold. For instance, consider a cone 
(say the solid cone) as illustrated in the next picture. 


The vertex and the circle surrounding the base disc prevent the cone 
from being a submanifold of R°. However, if we delete the vertex and 
this circle, what remains is a submanifold with boundary embedded in 
R°. The boundary conists of the conical shell, and of the base disc 
(without its surrounding circle). Another example is given by polyhedra, 
as on the following figure. 


The idea is to approximate a given form by a form with compact 
support, to which we can apply Theorem 2.1, and then take the limit. 
We shall indicate one possible technique to do this. 

The word “boundary” has been used in two senses: The sense of 
point set topology, and the sense of boundary of a manifold. Up to now, 
they were used in different contexts so no confusion could arise. We 
must now make a distinction, and therefore use the word boundary only 
in its manifold sense. If X is a subset of R%, we denote its closure by X 


[XII, §3] STOKES’ THEOREM WITH SINGULARITIES 315 


as usual. We call the set-theoretic difference X — X the frontier of X in 
R®, and denote it by fr(X). 

Let X be a submanifold without boundary of R%, of dimension n. We 
know that this means that at each point of X there exists a chart for an 
open neighborhood of this point in R™ such that the points of X in this 
chart correspond to a factor in a product. A point P of X — X will be 
called a regular frontier point of X if there exists a chart at P in R™ with 
local coordinates (x,,...,Xy) such that P has coordinates (0,...,0); the 
points of X are those with coordinates 


Xnty = = Xy = 0 and xX, <9; 


and the points of the frontier of X which lie in the chart are those with 
coordinates satisfying 


Xn = Xngy = = Xy =O. 


The set of all regular frontier points of X will be denoted by 0X, and 
will be called the boundary of X. We may say that X UdX is a sub- 
manifold of R%, possibly with boundary. 

A point of the frontier of X which is not regular will be called 
singular. It is clear that the set of singular points is closed in R™. We 
now formulate a version of Theorem 2.1 when @ does not necessarily 
have compact support in X UdX. Let S be a subset of R%. By a 
fundamental sequence of open neighborhoods of S we shall mean a se- 
quence {U,} of open sets containing S such that, if W is an open set 
containing S, then U, < W for all sufficiently large k. 

Let S be the set of singular frontier points of X and let w be a form 
defined on an open neighborhood of X, and having compact support. 
The intersection of supp w with (X UdX) need not be compact, so that 
we cannot apply Theorem 2.1 as it stands. The idea is to find a funda- 
mental sequence of neighborhods {U,} of S, and a function g, which is 0 
on a neighborhood of S and 1 outside U, so that g,@ differs from @ 
only inside U,. We can then apply Theorem 2.1 to g,@ and we hope 
that taking the limit yields Stokes’ theorem for w itself. However, we 
have 


| ita.0)= | in. 50+ | 9, do. 
x x x 


Thus we have an extra term on the right, which should go to 0 as k > co 
if we wish to apply this method. In view of this, we make the following 
definition. 

Let S be a closed subset of RX. We shall say that S is negligible for X 
if there exists an open neighborhood U of S in R¥, a fundamental 
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sequence of open neighborhoods {U,} of S in U, with U,< U, and a 
sequence of C' functions {g,}, having the following properties. 


NEG 1. We have 0S g, <1. Also, g,(x) =0 for x in some open neigh- 
borhood of S, and g,(x) = 1 for x ¢ U,. 

NEG 2. If @ is an (n—1)-form of class C1 on U, and 1, is the 
measure associted with |dg, \ w| on UX, then uy, is finite 
for large k, and 


lim u(U 0X) = 
ko 


From our first condition, we see that g,@ vanishes on an open neigh- 
borhood of S. Since g, = 1 on the complement of U,, we have dg, = 0 
on this complement, and therefore our second condition implies that the 
measures induced on X near the singular frontier by |dg, A | (for k = 1, 
2,...), are concentrated on shrinking neighborhoods and tend to 0 as 
k> oo. 


Theorem 3.1 (Stokes’ Theorem with Singularities). Let X be an 
oriented, C? submanifold without boundary of RN. Let dim X =n. Let 
@ be an (n — 1)-form ofclass C*' on an open neighborhood of X in RN, 
and with compact support. Assume that: 


(i) If S is the set of singular points in the frontier of X, then Sc 
supp o is negligible for X. 
(ii) The measures associated with |dw| on X, and |w| on 6X, are finite. 


Then 
| dw = | @ 
XxX Ox 


Proof. Let U, {U,}, and {g,} satisfy conditions NEG 1 and NEG 2. 
Then g,@ is 0 on an open neighborhood of S, and since w is assumed to 
have compact support, one verifies immediately that 


(supp g,@) A(X U dX) 


is compact. Thus Theorem 2.1 is applicable, and we get 


J 9.0 = d(g,@) = I, dg, \ o+| 9, dw. 
s|{ (t= ano 


= | 1 d pj = Mo (Uj, 0 OX). 
U,.NOX 


We have 


Gee a 
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Since the intersection of all sets U,0X is empty, it follows from purely 
measure-theoretic reasons that the limit of the right-hand side is 0 as 


k— oo. Thus 
lim | g,.0 = | o. 
k>0 Jax ox 


For similar reasons, we have 


im | onde = | do. 
k>o JX 4 


Our second assumption NEG 2 guarantees that the integral of dg, A w 
over X approaches 0. This proves our theorem. 


Criterion 1. Let S, T be compact negligible sets for a submanifold X of 
R™ (assuming X without boundary). Then the union Su T is negligible 
for X. 


Proof. Let U, {U,}, {g,} and V, {\}, {h,} be triples associated with S 
and T respectively as in condition NEG 1 and NEG 2 (with V replacing 
U and h replacing g when T replaces S). Let 


W=UUVY W=G,0h, and  fp=9hy- 


Then the open sets {W,} form a fundamental sequence of open neighbor- 
hoods of SUT in W, and NEG 1 is trivially satisfied. As for NEG 2, we 
have 


d(g,) Ao =h,dg, Ao + 9, dh, A o, 
so that NEG 2 is also trivially satisfied, thus proving our criterion. 


Criterion 2. Let X be an open set, and let S be a compact subset in R". 
Assume that there exists a closed rectangle R of dimension m<n—2 
and a C' map o: RR" such that S =o(R). Then S is negligible for 
xX. 


Before giving the proof, we make a couple of simple remarks. First, 
we could always take m =n — 2, since any parametrization by a rectan- 
gle of dimension < n—2 can be extended to a parametrization by a 
rectangle of dimension n—2 simply by projecting away coordinates. 
Second, by our first criterion, we see that a finite union of sets as 
described above, that is parametrized smoothly by rectangles of codimen- 
sion = 2, are negligible. Third, our Criterion 2, combined with the first 
criterion, shows that negligibility in this case is local, that is we can 
subdivide a rectangle into small pieces. 
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We now prove Criterion 2. Composing o with a suitable linear map, 
we may assume that R is a unit cube. We cut up each side of the cube 
into k equal segments and thus get k™ small cubes. Since the derivative 
of o is bounded on a compact set, the image of each small cube is 
contained in an n-cube in R™ of radius < C/k (by the mean value 
theorem), whose n-dimensional volume is < (2C)"/k". Thus we can cover 
the image by small cubes such that the sum of their n-dimensional vol- 
umes is 

S (20)"/k"™ S (20)"/k?. 


Lemma 3.2. Let S be a compact subset of R". Let U, be the open set 
of points x such that d(x, S)<2/k. There exists a C®” function g, on 
R® which is equal to 0 in some open neighborhood of S, equal to 1 
outside U,, OS g, <1, and such that all partial derivatives of g, are 
bounded by C,k, where C, is a constant depending only on n. 


Proof. Let @ be a C® function such that 0 < » < 1, and 


o(x)=0 if OS |x <3, 
g(x)=1 if 1S |IxI. 


We use || || for the sup norm in R". The graph of ¢ looks like this: 


For each positive integer k, let p(x) = p(kx). Then each partial deriva- 
tive D,@, satisfies the bound 


|D;9| < k\|D;el, 


which is thus bounded by a constant times k. Let L denote the lattice of 
integral points in R”. For each | € L, we consider the function 


I 
xr o(x - 5): 


This function has the same shape as ¢, but is translated to the point 
1/2k. Consider the product 


9(X) = Il Pr (x = x) 
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taken over all Je L such that d(l/2k, S) < 1/k. If x is a point of R” such 
that d(x, S) < 1/4k, then we pick an | such that 


d(x, I/2k) < 1/2k. 


For this 1 we have d(I/2, S) < 1/k, so that this 1 occurs in the product, 
and 
@,(x — 1/2k) = 0. 


Therefore g, is equal to 0 in an open neighborhood of S. If, on the other 
hand, we have d(x, S) > 2/k and if ! occurs in the product, that is 


d(1/2k, S) S 1/k, 
then 
d(x, 1/2k) > 1/k, 


and hence g,(x)=1. The partial derivatives of g, are bounded in the 
desired manner. This is easily seen, for if xo is a point where g, is not 
identically 1 in a neighborhood of x9, then ||x9 — [p/2k|| S 1/k for some 
I. All other factors g(x — 1/2k) will be identically 1 near x, unless 
xo — 1/2k|| S 1/k. But then ||! — Ip|| $4 whence the number of such | is 
bounded as a function of n (in fact by 9"). Thus when we take the 
derivative, we get a sum of at most 9” terms, each one having a deriva- 
tive bounded by C,k for some constant C,. This proves our lemma. 


We return to the proof of Criterion 2. We observe that when an 
(n — 1)-form @ is expressed n terms of its coordinates, 


“aN 
a(x) = ¥ f(x) dxy A+++ A dx; A+++ A AX,, 


then the coefficients f; are bounded on a compact neighborhood of S. 
We take U, as in the lemma. Then for k large, each function 


xr f(x) Digu() 


is bounded on U, by a bound C,k, where C, depends on a bound for a, 
and on the constant of the lemma. The Lebesgue measure of U, is 
bounded by C;/k?, as we saw previously. Hence the measure of U, 
associated with |dg, A w| is bounded by C,/k, and tends to 0 as k > oo. 
This proves our criterion. 


As an example, we now state a simpler version of Stokes’ theorem, 
applying our criteria. 


Theorem 3.3. Let X be an open subset of R". Let S be the set of 
singular points in the closure of X, and assume that S is the finite union 
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of C* images of m-rectangles with m < -n—2. Let w be an (n — 1)-form 
defined on an open neighborhood of X. Assume that w has compact 
support, and that the measure associated with |w| on 0X and with |da| 


on X are finite. Then 
| do = | @. 
x ax 


Proof. Immediate from our two criteria and Theorem 2. 


We can apply Theorem 3.3 when, for instance, X is the interior of a 
polyhedron, whose interior is open in R". When we deal with a submani- 
fold X of dimension n, embedded in a higher dimensional space R¥, then 
one can reduce the analysis of the singular set to Criterion 2 provided 
that there exists a finite number of charts for X near this singular set on 
which the given form w is bounded. This would for instance be the case 
with the surface of our cone mentioned at the beginning of the section. 
Criterion 2 is also the natural one when dealing with manifolds defined 
by algebraic inequalities. By using Hironaka’s resolution of singularities, 
one can parametrize a compact set of algebraic singularities as in Crite- 
rion 2. 

Finally, we note that the condition that @ have compact support in an 
open neighborhood of X is a very mild condition. If for instance X is a 
bounded open subset of R", then X is compact. If @ is any form on 
some open set containing X, then we can find another form ny which is 
equal to w on some open neighborhood of X and which has compact 
support. The integrals of 4 entering into Stokes’ formula will be the 
same as those of w. To find y, we simply multiply @ with a suitable C” 
function which is 1 in a neighborhood of X and vanishes a little further 
away. Thus Theorem 3.3 provides a reasonably useful version of Stokes’ 
theorem which can be applied easily to all the cases likely to arise 
naturally. 


CHAPTER  XiIll 


Applications of Stokes’ 
Theorem 


In this chapter we give a survey of applications of Stokes’ theorem, 
concerning many situations. Some come just from the differential theory, 
such as the computation of the maximal de Rham cohomology (the space 
of all forms of maximal degree modulo the subspace of exact forms); 
some come from Riemannian geometry; and some come from complex 
manifolds, as in Cauchy’s theorem and the Poincaré residue theorem. I 
hope that the selection of topics will give readers an outlook conducive 
for further expansion of perspectives. The sections of this chapter are 
logically independent of each other, so the reader can pick and choose 
according to taste or need. 


XIll, §1. THE MAXIMAL DE RHAM COHOMOLOGY 


Let X be a manifold of dimension n without boundary. Let r be an 
integer = 0. We let W(X) be the R-vector space of differential forms on 
X of degree r. Thus o/'(X)=0 if r>n. If we."(X), we define the 
support of @ to be the closure of the set of points xe X such that 
a(x) #0. 


Examples. If w(x) = f(x) dx, A+: A dx, on some open subset of R’, 
then the support of w is the closure of the set of x such that f(x) # 0. 


We denote the support of a form w by supp(m). By definition, the 
support is closed in X. We are interested in the space of maximal degree 
forms "(X). Every form @ € "(X) is such that do = 0. On the other 
hand, »"(X) contains the subspace of exact forms, which are defined to 
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be those forms equal to dy for some ne." 1(X). The factor space is 
defined to be the de Rham cohomology H"(X) = H"(X,R). The main 
theorem of this section can then be formulated. 


Theorem 1.1. Assume that X is compact, orientable, and connected. 


Then the map 
o| 7) 
x 


induces an isomorphism of H"(X) with R itself. In particular, if @ is in 
s$"(X) then there exists 4 €.""1(X) such that dn = if and only if 


[o=o 
x 


Actually the hypothesis of compactness on X is not needed. What is 
needed is compactness on the support of the differential forms. Thus we 
are led to define !(X) to be the vector space of n-forms with compact 
support. We call a form compactly exact if it is equal to dy for some 
ne SS! 4(X). We let 


H"(X) = factor space f?(X)/d.w""!(X). 
Then we have the more general version: 


Theorem 1.2. Let X be a manifold without boundary, of dimension n. 
Suppose that X is orientable and connected. Then the map 


on | @ 
xX 


induces an isomorphism of H?(X) with R itself. 


Proof. By Stokes’ theorem (Chapter XII, Corollary 2.2) the integral 
vanishes on exact forms (with compact support), and hence induces an 
R-linear map of H?(X) into R. The theorem amounts to proving the 
converse statement: if 

| wo =0, 
Xx 


then there exists some n € ."!(X) such that w = dy. For this, we first 
have to prove the result locally in R", which we now do. 
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As a matter of notation, we let 
I" = (0, 1)" 
be the open n-cube in R”. What we want is: 


Lemma 1.3. Let w be an n-form on I", with compact support, and such 


that 
| w= 0. 
yg. 


Then there exists a form n € x" 1(I""1) with compact support, such that 

@ = dn. 
We will prove Lemma 1.3 by induction, but it is necessary to load the 
induction to carry it out. So we need to prove a stronger version of 


Lemma 1.3 as follows. 


Lemma 1.4. Let w be an (n—1)-form on I" whose coefficient is a 
function of n variables (x1, ...,X,) SO 


(x) = f(x1,...5%,) dX, Acts A dxXq_4. 


(Of course, all functions, like forms, are assumed C®.) Suppose that w 
has compact support in I""1. Assume that 


| o=0. 
qni 


Then there exists an (n — 2)-form n, whose coefficients are C® functions 
Of X14, -++5X, Such that 


w(x; peeeoXn—5 Xn) = d,-1n(X1 seeeoXy—15 Xp). 


The symbol d,_, here means the usual exterior derivative taken with 
respect to the first n — 1 variables. 


Proof. By induction. We first prove the theorem when n—1=1. 
First we carry out the proof leaving out the extra variable, just to see 
what’s going on. So let 


(x) = f(x) dx, 


where f has compact support in the open interval (0,1). This means 
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there exists ¢ >0 such that f(x)=0 if O<x Se and if 1—ex<x<l. 
We assume 


f. I(x) dx = 0. 
Let 


glx) = | : fit) dt. 


Then g(x) = 0 if 0 < x <e, and also if 1 -e < x < 1, because for instance 
ifil—e<x <1, then 


g(x) = i f(t) dt = 0. 


Then f(x) dx = dg(x), and the lemma is proved in this case. Note that 
we could have carried out the proof with the extra variable x,, starting 
from 
(x) = f(%1, x2) dx, 
so that 
1 


G(X, X2) = | S(t, X) dt. 


0 


We can differentiate under the integral sign to verify that g is C® in the 
pair of variables (x,, x2). 

Now let n = 3 and assume the theorem proved for n — 1 by induction. 
To simplify the notation, let us omit the extra variable x,,,, and write 


(x) = f(X1,---5X_) dX, Avt1 A dXy, 


with compact support in I”. Then there exists e >0 such that the sup- 
port of f is contained in the closed cube 


I"(e) = [e, 1 — e]’. 


The following figure illustrates this support in dimension 2. 
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Let w be an (n — 1)-form on I""1, W(x) = W(x,,...,X,-1) such that 


y=1, 


yr 


and w has compact support. Let 
ac)= | FIX j 564s Xn-15 Xs) dx, Av A AXy,-4 
jn-1 


-| S (Kp 5 00+sXp—15 Xp) UX, At A dXq-y- 


I” \(e) 


Note here that we do have the parameter x, coming in at the inductive 
step. Let 


L(x) = f(x) dx, Av N aX,-1 _ 9(X,)W (x4 bere ee »Xn-1)s 
so 


(*) L(x) A dx, = @(x) — g(Xn)W(X) A dx, 


Z H= g(X,) — g(X,) = 9. 


Furthermore, since f has compact support, so does g (look at the figure). 
By induction, there exists an (n — 1)-form yn, of the first n — 1 variables, 
but depending on the parameter x,, that is 


n(x) se n(x4 gorere »Xn-13 Xn) 
such that 


(X45 66 -5Xn—13 Xn) = Ay (X 15 ---.Xp—15 Xp) 


Here d,_, denotes the exterior derivative with respect to the first n— 1 
variables. Then trivially, 


H(X45 -++sX p13 Xq) A AX, = dy_y (X15 --- Xp Xp) A AX, 


= dn(x), 


where dy is now the exterior derivative taken with respect to all n 
variables. Hence finally from equation (*) we obtain 


(+#) (x) = dn(x) *f: I(Xn)W (4, 585 3M) A dx,,. 
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To conclude the proof of Lemma 1.3, it suffices to show that the second 
term on the right of (**) is exact. We are back to a one-variable prob- 
lem. Let 


h(x,) = [- g(t) dt. 


Then dh(x,) = g(x,) dx,, and h has compact support in the interval (0, 1), 
just as in the start of the induction. Then 


A(h(x,)W(X4 ass »Xn-1)) = dh(x,,) A W(x, $956 »Xn—1) 
= (=I 8g (en )WX1, + Xn-1) A aX, 


because dy = 0. Of course we could have carried along the extra param- 
eter all the way through. This concludes the proof of Lemma 1.3. 


We formulate to an immediate consequence of Lemma 1.3 directly on 
the manifold. 


Lemma 1.5. Let U be an open subset of X, isomorphic to I". Let 
Ww € "(U) be such that 

[ veo 

U 


Let we "(U). Then there exists ce R and yn € "1(U) such that 


@— cw = dy. 


Proof. We take c= | 


U 


o / | yw and apply Lemma 1.3 to w — cw. 
U 


Observe that the hypothesis of connectedness has not yet entered the 
picture. The preceding lemmas were purely local. We now globalize. 


Lemma 1.6. Assume that X is connected and oriented. Let U, W be as 
in Lemma 1.5. Let V be the set of points x € X having the following 
property. There exists a neighborhood U(x) of x isomorphic to I" such 
that for every we S"(U(x)) there exist ceR and ne "'(X) such 
that 

@ — cw = dy. 
Then V = X. 


Proof. Lemma 1.5 asserts that V>U. Since X is connected, it 
suffices to prove that V is both open and closed. It is immediate from 
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the definition of V that V is open, so there remains to prove its closure. 
Let z be in the closure of V. Let W be a neighborhood of z isomorphic 
to I". There exists a point xe VOW. There exists a neighborhood 
U(x) as in the definition of V such that U(x) c W. For instance, we may 
take 

U(x) © (a,, b1) x ++: x (a,, 5,) & I" 


with a; sufficiently close to 0 and b, sufficiently close to 1, and of course 
0 <a,<)b, for i=1,...,n. Let W, €."(U(x)) be such that 


w=. 


U(x) 


Let we ."(W). By the definition of V, there exist c, ER and 7, € W(X) 
such that 


W, — cw =dn,. 


By Lemma 1.5, there exists c, e R and ny, € %3(X) such that 


@ — C,W, = dnp. 
Then 


@ — c2¢,W = d(y2 + C2), 
thus concluding the proof of Lemma 1.6. 


We have now reached the final step in the proof of Theorem 5.2, 
namely we first fix a form we V(U) with U x I" and w #0. Let 


Xx 
w€ .W"(X). It suffices to prove that there exist ce R and ne .""'(X) 
such that 
@ — cw = dn. 


Let K be the compact support of w. Cover K be a finite number of 
open neighborhoods U(x,), ...,U(x,,) satisfying the property of Lemma 
1.6. Let {g;} be a partition of unity subordinated to this covering, so 
that we can write 


a=) go. 


Then each form g;@ has support in some U(x,). Hence by Lemma 5.6, 
there exist c; € R and n; € o£" 1(X) such that 


Q,o — cy = dn;, 


whence w — cy =dn, with c=)\c; and n=)\n;. This concludes the 
proof of Theorems 1.1 and 1.2. 
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XIll, §2. MOSER’S THEOREM 


We return here to the techniques of proof in Chapter V, as for Poincaré’s 
lemma, Theorem 5.1 and Darboux’s Theorem 7.3 of that chapter. How- 
ever, we now have a similar theorem in the context of integration. 

We first make the general comment, similar to the one we made 
previously, for general forms. Let E be a Banach space, and let w be an 
r-multilinear alternating form on E (so R-valued). We say that o is 
non-singular if for each vector v € E, defining w, by 


Dy! (Vy 5 «+ +5Vp—1 E> (U, 015 «+ 5D, 1) 


the map vt-q, is a toplinear isomorphismm between E and L’*(E). We 
previously considered bilinear forms, in Chapter V, §6. 

We can globalize the notion to a manifold, so a form we .#"(X) is 
called non-singular if w(x) is non-singular for each x. It is clear that in 
the finite dimensional case, a volume form is non-singular. With this 
globalization, we obtain: 


Proposition 2.1. Let w be a non-singular r-form on X. Given a form 
ne A" 4(X), there exists a unique vector field — such that 


@aoé=y. 
We could also write the relation with the contraction notation, i.e. 


C0 = 7. 
We now come to Moser’s theorem [Mo 65]. 


Theorem 2.2. Let X be a compact, connected oriented manifold of 
dimension n. Let @, W € of"(X) (= "(X)) be volume forms such that 


[o-fs 


Then there exists an automorphism f: X > X of X such that w = f*y. 


Proof. Let 
o,=(1—-—s)m + sw for OSsS1. 


Then w, is a volume form for each s, and in particular is non-singular. 
By Theorem 1.1, there exists 4 ¢."-1(X) such that yw —w=dy. Note 
also that w — w = dw,/ds. Since w, is non-singular, there exists a unique 
vector field €, such that 


@,°¢,= YN. 
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Let «, be the flow of é,. Then «, is defined on R x X by Corollary 2.4 
of Chapter IV. Then we get: 


dw 
* s 
u=s - es ( ds ) 


= a*d(w, o €,) + a*(’ — w) by Proposition 5.2 of Chapter V 


d 


d 
* (y* = * 
{,082:) = = (ato,) 


= —a* dyn + a* dn 
= 0. 


Therefore «*w, is constant as a function of s, so we find 
@ = at@y = ato, = f*y, with f=o,, 


thereby proving the theorem. 


XI, §3. THE DIVERGENCE THEOREM 


Let X be an oriented manifold of dimension n possibly with boundary, and 
let Q be an n-form on X. Let & be a vector field on X. Then dQ =0, 
and hence the basic formula for the Lie derivative (Chapter V, Proposi- 
tion 5.3) shows that 

LQ = d(Qo €). 


Consequently in this case, Stokes’ theorem yields: 


Theorem 3.1 (Divergence Theorem). 


Remark. Even if the manifold is not orientable, it is possible to use 
the notion of density to formulate a Stokes theorem for densities. Cf. 
Loomis-Sternberg [Los 68] for the formulation, due to Rasala. How- 
ever, this formulation reduces at once to a local question (using parti- 
tions of unity on densities). Since locally every manifold is orientable, 
and a density then amounts to a differential form, this more general 
formulation again reduces to the standard one on an orientable manifold. 


Suppose that (X,g) is a Riemannian manifold, assumed oriented for 
simplicity. We let Q or vol, be the volume form defined in Chapter X, 
§1. Let w be the canonical Riemannian volume form on @X for the 
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metric induced by g on the boundary. Let n, be a unit vector in the 
tangent space T,(X) such that n, is perpendicular to T,(@X). Such a unit 
vector is determined up to sign. Denote by ny its dual functional, i.e. the 
component on the projection along n,. We select n, with the sign such 
that 

ny A (x) = Q(x). 


We then shall call n, the unit outward normal vector to the boundary at 
x. In an oriented chart, it looks like this. 


Then by formula CON 3 of Chapter V, §5 we find 
Qoe =<, ow —nY A (wo), 
and the restriction of this form to 0X is simply <n, €>w. Thus we get: 
Theorem 3.2 (Gauss Theorem). Let X be a Riemannian manifold. Let 
@ be the canonical Riemannian volume form on 0X and let Q be the 
canonical Riemannian volume form on X itself. Let n be the unit out- 


ward normal vector field to the boundary, and let € be a C! vector 
field on X, with compact support. Then 


| (divg 6)Q = | <n, >a. 
x ax 


The next thing is to show that the map d* from Chapter X, §1 is the 
adjoint for a scalar product defined by integration. First we expand 
slightly the formalism of d* for this application. Recall that for any 
vector field ¢, the divergence of € is defined by the property 


(1) d(vol, o ¢) = (div €) vol,. 
Note the trivial derivation formula for a function 9: 


(2) div(pc) = @ div ¢ + (de)(¢). 
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If A is a 1-form, ie. in PL1(TX) = .1(X), we have the corresponding 
vector field €, = AY uniquely determined by the condition that 


(E45 Mq = An) for all vector fields 7. 
For a 1-form /, we define the operator 
d*: oJ'(X) > A(X) = R(X) by d*j = —div ,, 
so by (1), 
(3) (d*A) vol, = d(vol, © €,). 
We get a formula analogous to (2) for d*, namely 
(4) d*(pd) = pd*d — dg, A). 


Indeed, d*(pd) = —div ¢,, = —div(g¢,) = —odiv¢, —(d@)(C,) by (2), 
which proves the formula. 


Let 4, w € W1(TX). We define the scalar product via duality 


<A, >, = (Cas Cay: 
Then for a function @ we have the formula 
(5) <dg, A>, vol, = (pd*A) vol, — d(vol, o @é;). 
Indeed, 
<dg, 4), vol, = [pd*4 — d*(p)] vol, by (4) 
= (pd*A) vol, — d(vol, © pé,) by (3) 
thus proving (5). Note that the congruence of the two forms <dgq, 4), vol, 
and (gd*4) vol, modulo exact forms is significant, and is designed for 
Proposition 3.3 below. 
Observe that the scalar product between two forms above is a func- 
tion, which when multiplied by the volume form vol, may be integrated 


over X. Thus we define the global scalar product on 1-forms with com- 
pact support to be 


<A, O>(x,9) = <A, @)>x a | <A, DY, vol,. 
Xx 


Applying Stokes’ theorem, we then find: 
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Proposition 3.3. Let (X,g) be a Riemannian manifold, oriented and 
without boundary. Then d* is the adjoint of d with respect to the global 
scalar product, i.e. 


(dg, Ax a <Q, d*))y. 
We define the Laplacian (operating on functions) to be the operator 
A = d*d. 


For the Laplacian operating on higher degree forms, we shall give the 
expression d*d+dd* in the next section, but here for functions, the 
second term disappears. 

For a manifold with boundary, we define the normal derivative of a 
function @ to be the function on the boundary given by 


On? Tae <n, Cage = <n, grad, Pq: 


Theorem 3.4 (Green’s Formula). Let (X,g) be an oriented Riemannian 
manifold possibly with boundary, and let o, W be functions on X with 
compact support. Let w be the canonical volume form associated with 
the induced metric on the boundary. Then 


[, (pA — WAg) vol, = -| (POaW — Woqp)o. 
Proof. From formula (4) we get 


d*(p dp) = pAw — (dg, dp>,, 
whence 


pAy — pAg = d*(o dy) — d*(p dg) 
= —div(9 dp) + div(W do). 


We apply Theorem 3.2 to conclude the proof. 


Remark. Of course, if X has no boundary in Theorem 3.7, then the 
integral or the left side is equal to 0. 


Corollary 3.5 (E. Hopf). Let X be a Riemannian manifold without 
boundary, and let f be a C? function on X with compact support, such 
that Af =0. Then f is constant. In particular, every harmonic function 
with compact support is constant. 
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Proof. We first give the proof assuming that X is oriented. By Green’s 
formula we get 


| Af vol, = 0. 
Xx 


Since Af = 0, it follows that in fact Af =0, so we are reduced to the 
harmonic case. We now apply Green’s formula to f?, and get 


0= | Af? vol, = | 2f Af vol, — | 2(grad f)* vol,. 
x x x 


Hence (grad f)? =0 because Af = 0, and finally grad f=0, so df =0 
and f is constant, thus proving the corollary in the oriented case. For 
the non-oriented case, by Proposition 4.6 of Chapter XI, there exists a 
covering of degree 2 of X which is oriented, and then one can pull back 
all the objects from X to this covering to conclude the proof in this case. 


XIll, §4. THE ADJOINT OF d FOR 
HIGHER DEGREE FORMS 


We extend the results of the preceding section to arbitrary forms. Given 
the vector space V of dimension n over R, with a positive definite scalar 
product g, we note that the exterior powers /V V are self dual, with a 
positive definite scalar product such that 


(0) Av A U,, Wy A+ A W,)F > detcy;, W>,- 


We defined the notion of orientation on V in Chapter X, §5, and we now 
assume that V is oriented. 


Proposition 4.1. Given 1 <r <n, there exists a unique isomorphism 
ax NVONTYV 
such that for o, we’ V we have 


<Q, W>, vol, = @ A *. 


Proof. The proof will give an explicit determination of the isomor- 
phism on the usual basis for A’ V. Let I =[i, <i, <:::<i,] be an 
ordered set of r indices. We let {e,,...,¢,} be orthonormal oriented, and 
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If I’ is another such ordered set with n—r elements, and JUI'= 
{1,...,n} then we let €, be the sign of the permutation (I, J) of (1, ...,n). 
We then define 


¥#e; = E77, 


and extend this operation by linearity to all of A’ V. Then directly from 
the definition, we see that if J is an ordered set of r indices, then 


ey A *ey = (l7, 0p ey Ao A, 


= 07y1 A A ey. 


Thus on the standard basis elements of A” V the desired relation of the 
proposition is satisfied. The same relation is therefore satisfied for all 
elements of /(\ V, as desired. 


We define the operator w on the direct sum @, /\ V to have the 
effect 
w= (— 1 on N V. 


Proposition 4.2. We have «w= wx. If n is even, then w=(—1)" on 
INV. Furthermore, +** = w. 


Proof. Direct, simple computations. 


We now apply the above to a Riemannian manifold X of dimension n, 
and to real differential forms. We let 


A(X) 
be the space of C® differential forms of degree r, with compact support 
on the manifold. At each point x eX, we use the space V = T,Y (the 
dual space of the tangent space). The usual operator 


d: f"(X) > of'**(X) 


is R-linear. By Stokes’ theorem, if @ has compact support, then 


| da = 0. 
xX 


We shall give an application of this fact in a Riemannian context. We 
have the volume form vol, (which does not necessarily have compact 
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support) and we define a scalar product on .!(X) by the formula 


C9, Wx,g = <P Wx = [, Qray= [, <9, W>, vol,, 


where we usually omit the index g and merely write X as in <¢, Wy. 


Proposition 4.3. The exterior derivative d has an adjoint d* with respect 
to the scalar product < , dy, namely for g € "*(X) and WE A(X) we 
have 


(dQ, Wx = CP, Fx. 
Furthermore, the adjoint is given by the explicit formula 


* = (—1)""* ede on G(X). 


= —#d* if n is even. 


Proof. By Stokes’ theorem, we have: 
| dp nw -| d(p a (9 | o a day 
xX XxX X 


=(-17 | gn dey. 
xX 


Now 
(19 A dey =(—I'9 A wewdey 


=(— 19 A wa(+d)p 
= (—1"""@ A Hada), 


which proves the proposition. 


XIll, §5. CAUCHY’S THEOREM 


It is possible to define a complex analytic (analytic, for short) manifold, 
using open sets in C” and charts such that the transition mappings are 
analytic. Since analytic maps are C”, we see that we get a C” manifold, 
but with an additional structure, and we call such a manifold complex 
analytic. It is verified at once that the analytic charts of such a manifold 
define an orientation. Indeed, under a complex analytic change of charts, 
the Jacobian changes by a complex number times its complex conjugate, 
so changes by a positive real number. 
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If z,, ...,Z, are the complex coordinates of C”, then 


(Zig sesh e525 -Z 7986052) 


can be used a C® local coordinates, viewing C" as R*". If z, =x, + iy, 
then 
dz, = dx, + i dy, and dz, = dx, — idy,. 


Differential forms can then be expressed in terms of wedge products of 
the dz, and dz,. For instance 


dz, A dz, = 2idy, A dx,. 


The complex standard expression for a differential form is then 


(2) = Y py, p(z) dz;, A+++ A dz;, A dz), A+++ A dZ,.. 
cm) 
Under an analytic change of coordinates, one sees that the numbers r 
and s remain unchanged, and that if s=0 in one analytic chart, then 
s =0 in any other analytic chart. Similarly for r. Thus we can speak of 
a form of type (r,s). A form is said to be analytic if s = 0, that is if it is 
of type (r, 0). 

We can decompose the exterior derivative d into two components. 
Namely, we note that if w is of type (r,s), then dw is a sum of forms of 
type (r + 1, s) and (r,s + 1), say 


da = (da) 41,5) + (da), 5+1)- 
We define _ 
60 = (do) y+41,s) and 0@ = (dod), 541): 


In terms of local coordinates, it is then easy to verify that if w is 
decomposable, and is expressed as 


(z) = (2) dz;, \ +++ A dz, A dZ;,, A+: A dz, = QO, 
then 
do =) ea AO 
az, * 


and 


In particular, we have 


PMP), pe Stee 
dz, 2\dx, OV, Gz, 2\dx,  dy,/° 
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(Warning: Note the position of the plus and minus signs in these 
expressions.) 


Thus we have 


and operating with @ or 6 follows rules similar to the rules for operating 
with d. 

Note that f is analytic if and only if df =0. Similarly, we say that a 
differential form is analytic if in its standard expression, the functions 
Qi, are analytic and the form is of type (r,0), that is there are no dz, 
present. Equivalently, this amounts to saying that dw = 0. The following 
extension of Cauchy’s theorem to several variables is due to Martinelli. 

We let |z| be the euclidean norm, 


\z| = (242, + °° + 2,2,)'?. 


Theorem 5.1 (Cauchy’s Theorem). Let f be analytic on an open set in 
C" containing the closed ball of radius R centered at a point ¢. Let 


@,(Z) = dz, A+++ A dz, \ dZ, °°" nr di, Act A dZ, 
and 


(z) = 2 (—1)*Z,,(2). 
Let Sp be the sphere of radius R centered at €. Then 


n— 1)! f(2) 


ee 
A) = nae | eat 


pel? —¢) 


where &(n) = (— yet, 


Proof. We may assume ( = 0. First note that 


Booz) = ¥ (— 1k dz A c(z) =(—1)"*!n dz a dz, 


k=1 


where dz = dz, A--: A dz, and similarly for dz. Next, observe that if 


(2) = 


Zz 
pn) 


then 
dy =0. 


This is easily seen. On the one hand, dW =0 because w already has 
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dz, \-*: A dz,, and any further dz; wedged with this gives 0. On the 
other hand, since f is analytic, we find that 


BW(2) = f( 8(7) - 0 


by the rule for differentiating a product and a trivial computation. 
Therefore, by Stokes’ theorem, applied to the annulus between two 
spheres, for any r with O <r< R we get 


[+ [o-s 
Sr S,. 


| se) - | p22) 
Sr S, 


\z|?" 


or in other words, 


=e | flo(2) 
S, 


Using Stokes’ theorem once more, and the fact that dw = 0, we see that 
this is 
1 
=z 5( foo) = =|, f Be. 


r 


We can write f(z) = f(0) + g(z), where g(z) tends to 0 as z tends to 0. 
Thus in taking the limit as r>0, we may replace f by f(0). Hence our 
last expression has the same limit as 


1 = 1 
fOr i 6a = I Orn {, (—1)"*'ndz a dz. 


But 
dz a dz = (—1)" VP ind" dy, A dx, A+++ A dy, A AX_- 
Interchanging dy, and dx, to get the proper orientation gives another 


contribution of (— 1)", together with the form giving Lebesgue measure. 
Hence our expression is equal to 


SO)(- Ire n(2i)" 5, i V(B,), 


where V(B,) is the Lebesgue volume of the ball of radius r in R?", and is 
classically known to be equal to x"r?"/n!. Thus finally we see that our 
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expression is equal to 


(2niy" 
(n— 1)! 


f((—1yen? 


This proves Cauchy’s theorem. 


XIll, §6. THE RESIDUE THEOREM 


Let f be an analytic function in an open set U of C”. The set of zeros of 
f is called a divisor, which we denote by V= V,. In the neighborhood of 
a regular point a, that is a point where f(a)=0 but some complex 
partial derivative of f is not zero, the set V is a complex submanifold of 
U. In fact, if, say, D, f(a) # 0, then the map 


(245-0652) (Z15 «+ +5Zn-1> f(2)) 


gives a local analytic chart (analytic isomorphism) in a neighborhood of 
a. Thus we may use f as the last coordinate, and locally V is simply 
obtained by the projection on the set f =0. This is a special case of the 
complex analytic inverse function theorem. 

It is always true that the function log|f| is locally in #1. We give the 
proof only in the neighborhood of a regular point a. In this case, we can 
change f by a chart (which is known as a change-of-variable formula), 
and we may therefore assume that f(z) = z,. Then log|f| = log|z,|, and 
the Lebesgue integral decomposes into a simple product integral, which 
reduces our problem to the case of one variable, that is to the fact that 
log|z| is locally integrable near 0 in the ordinary complex plane. Writing 
z=re", our assertion is obvious since the function rlogr is locally 
integrable near 0 on the real line. 


Note. In a neighborhood of a singular point, the fastest way and 
formally clearest, is to invoke Hironaka’s resolution of singularities, which 
reduces the question to the non-singular case. 

For the next theorem, it is convenient to let 

c=" 6-3 
Ani P 
Note that 


The advantage of dealing with d and d° is that they are real operators. 
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The next theorem, whose proof consists of repeated applications of 
Stokes’ theorem, is due to Poincaré. It relates integration on V and U 
by a suitable kernel. 


Theorem 6.1 (Residue Theorem). Let f be analytic on an open set U of 
C" and let V be its divisor of zeros in U. Let w be a C® form with 
compact support in U, of degree 2n — 2 and type (n—1,n— 1). Then 


| v =| log| f dd. 
Vv U 


(As usual, the integral on the left is the integral of the restriction of w 
to V, and by definition, it is taken over the regular points of V.) 


Proof. Since y and dd‘y have compact support, the theorem is local 
(using partitions of unity). We give the proof only in the neighborhood 
of a regular point. Therefore we may assume that U is selected sufficiently 
small so that every point of the divisor of f in U is regular, and such 
that, for small ¢, the set of points 


U, = {z€ U, |f(z)| 2 e} 
is a submanifold with boundary in U. The boundary of U, is then the 
set of points z such that | f(z)|=«. (Actually to make this set a sub- 
manifold we only need to select ¢ to be a regular value, which can be 
done for arbitrarily small ¢ by Sard’s theorem.) For convenience we let 


S, be the boundary of U,, that is the set of points z such that | f(z)| = «. 
Since log|f| is locally in #', it follows that 


| lol flaa'y = tim | log| f|dd‘w. 
U, €>0 JU, 
Using the trivial identity 


d(log| f|d°p) = d log|f| » d°w + log| f\dd*y, 


we conclude by Stokes’ theorem that this limit is equal to 


lim | log| f|d°y -| dlog|f| A av], 
e670 S, U; 


The first integral under the limit sign approaches 0. Indeed, we may 
assume that f(z) =z, = re. On S, we have |f(z)| =, so log|f| = log. 
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There exist forms w,, w2 in the first n — 1 variables such that 
dw =, a dz,+ 2 a dz,, 
and the restriction of dz, to S, is equal to 
sie® dd, 


with a similar expression for dz,. Hence our boundary integral is of type 


é log a o, 
S. 


where @ is a bounded form. From this it is clear that the limit is 0. 
Now we compute the second integral. Since w is assumed to be of 
type (n — 1,n — 1) it follows that for any function g, 


égndW=0 and dgadw=0. 
Replacing d and d° by their values in terms of 0 and 4, it follows that 
-| dlog|f| a d°w -| d log|f| a dw. 
U, U, 


We have 
d(d‘ log|f| \ w) = dd‘ log|f| 0 w — d‘log|f| a dy. 


Furthermore dd‘ is a constant times 00, and dd‘ log|f|? =0 in any open 
set where f 4 0, because 


€0 log| f|? = dd(log f + log f) = 0 


since 0 log f = 0 and 6 log f = 0 by the local analyticity of log f. Hence 
we obtain the following values for the second integral by Stokes: 


| d log|f |? nay = | d‘ log|f|? a w. 
U, S, 


Since 


a logifP = — 2 (@ — Slog f + log) 


(always assuming f(z) = z,), we conclude that if z, = re’, then the re- 
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striction of d‘ log|f|? to S, is given by 


dé 
tess d° log f = =e 


Now write & in the form 


Y=+2 


where W, contains only dz,, dz; for j = 1, ...,.n — 1 and w, contains dz, or 
dz,. Then the restriction of y, to S, contains d0, and consequently 


ic 2 _ a0 
{, d log|f/? A w= I, an (W,|S,). 


The integral over S, decomposes into a product integral, we respect to 
the first n — 1 variables, and with respect to dO. Let 


(n—1) 
| W1(2)|S, aa g(Z,). 


Then simply by the continuity of g we get 


2n 


1 : 
lim — g(ee"*) dO = g(0). 
2n Jo 


e>0 


Hence 


. dé 
tim I, an A (Wi |S.) = ie W- 


But the restriction of y, to the set z,=0 (which is precisely V) is the 
same as the restriction of y to V. This proves the residue theorem. 


APPENDIX 


The Spectral Theorem 


The following is a set of notes from a seminar of Von Neumann around 
1950. 


APP., §1. HILBERT SPACE 


Let E be a vector space over C (The real theory follows exactly the 
same pattern.) By an inner product on E we mean an R-bilinear pairing 
<x, y>€C of Ex E into C such that, for all complex numbers a, we 
have: 


Cax, yo =aCX, VY), <X YO =<KY, xX), 


<x, x> = 0 and equals 0 if and only if x = 0. 
We have the Schwartz inequality: 


Ix, yl? S <x, x><y, y> 


whose proof is as follows. For all «, 6 complex, 


0S (ax + By, ax + By> = a(x, x) + BaXx, y> + aB<x, y> + BBXY, y>. 


We let « = <y, y> and B = —<x, y>. The inequality drops out. 

We define the norm of a vector x to be <x, x>*? and denote it by |x|. 
Using the Schwrtz inequality, one sees that |x| defines a metric on E, the 
distance between x and y being |x — y|. The norm is continuous. 

We write x | y and say that x is perpendicular to y if <x, y> = 0. 

. The following identities are useful and trivially proved. 
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Parallelogram Law. |x + y|? + |x — y|? = 2|x|? + 2|y|?. 
Pythagoras Theorem. If x 1 y, then |x + y|? = |x|? 4+ |y|?. 


A Hilbert space is an inner product space which is complete under the 
induced metric. For the rest of this appendix, a subspace will always 
mean a closed subspace, with its structure of Hilbert space induced by 
that of E. 


Lemma 1.1. Let F be a subspace of E, let x € E, and let 
a = inf|x — y| 


the inf taken over all ye F. Then there exists an element yy¢¥F such 
that a = |x — yol. 


Proof. Let y, be a sequence in F such that |y, —x| tends to a. We 
must show that y, is Cauchy. By the parallelogram law, 


ln ~ ya = 21Vn AT x|? + 21Ym = x|? i 4120. + Ym) 7 x" 
S2|ye=— xP +2 ¥q = xP? — 4a" 


which shows that y, is Cauchy, converging to some vector y 9. The 
lemma follows by continuity. 


Theorem 1.2. If F is a subspace properly contained in E, then there 
exists a vector z in E which is perpendicular to F (and # 0). 


Proof. Let x €E and x ¢F. Let yo be an element of F which is at 
minimal distance from x (use Lemma 1.1). Let a be this distance and let 


Z= yo —x. After a translation, we may assume that z = x, so that |x| = 
a. For any complex number « and ye F we have |x + ay| = a, whence 


Kx + ay, x + ay) = |x|? + Hx, y> + a<x, y> + a&|yl? 
>a’, 


Put « = t<x, y>. We get a contradiction for small values of t. 


APP., §2. FUNCTIONALS AND OPERATORS 


A linear map A from a Hilbert space E to a Hilbert space H is bounded 
if there exists a positive real number «@ such that 


|Ax| < a|x| 


for all x € E. The norm of A, denoted by |A| is the inf of all such «. 
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Proposition 2.1. A linear map is bounded if and only if it maps the unit 
sphere on a bounded subset, if and only if it is continuous. 


Proof. Clear. 


A functional is a continuous linear map into C. Functionals are 
bounded. We have the fundamental: 


Representation Theorem. A linear map 1:E-—C is bounded if and only 
if there exists ye E such that A(x) = <x, y> for all xEE. If such a y 
exists, it is unique. 


Proof. If A(x) = <x, y> then the Schwartz inequality shows that it is 
bounded, with bound |y|. It is obvious that y is unique. 

Conversely, let 2 be bounded. Let F be the kernel of 4. Then F is a 
subspace. If E = F then everything is trivial. If E 4 F, then there exists 
zeF, z¢€E such that z is perpendicular to F by Theorem 1.2. We 
contend that some multiple y = az does it. A necessary condition on a is 
that 

(z, az) = @|z|?. 


This is also sufficient. Namely, x — (A(x)|A(z))z lies in F. Put a= 
A(z)/|z|?.. Then one sees at once that 4(x) = <x, y> as was to be shown. 


By an operator we shall always mean a continuous linear map of a 
space into itself. 


It is straightforward to show that operators form a Banach space, and 
in fact a normed ring. In other words, in addition to the Banach space 
property, we have 

|AB| S| A||Bl. 


Proposition 2.2. If A is an operator and <Ax,x> =0 for all x, then 
A=0O. 


Proof. This follows from the polarization identity, 
(A(x + y), (x + y)> — <A(x — y), (& — Y)> = 2[¢AX, y> + <AY, x]. 
Replace x by ix. Then we get 


«Ax, y> + <Ay, x> = 0, 
i<Ax, y> — i< Ay, x> = 0, 


for all x, y whence <Ax, y> = 0 and A =O. 
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The above proposition is valid only in the complex case. 

In the real case, we shall need it only when A is symmetric (see 
below), in which case it is equally clear. A similar remark applies to the 
next result. 


Lemma 2.3. Let A be an operator, and c a number such that 
|< Ax, x>| S e|x/? 
for all x €E. Then for all x, y we have 
|<Ax, y>| + Xx, Ay>| S 2c|xI lyl- 
Proof. By the polarization identity, 


2|<Ax, y> + <Ay, x>| S elx + yl? + c|x — yl? = 2c(|x|? + |y|?). 


Hence 
|< Ax, y> + <Ay, x>| S (|x|? + ly|?). 


We multiply y by e and thus get on the left-hand side 
|e~*<Ax, y> + el? CAy, x>|. 


The right-hand side remains unchanged, and for suitable 0, the left-hand 
side becomes 


|<Ax, y>| + |<Ay, x I. 


(In other words, we are lining up two complex numbers by rotating one 
by 6 and the other by —6.) Next we replace x by tx and y by y/t for t 
real and t>0. Then the left-hand side remains unchanged, while the 
right-hand side becomes 


1 
g(t) = t? |x|? + lvl. 


The point at which g’(t) = 0 is the unique minimum, and at this point ty 
we find that 

(to) = |xIlyl- 
This proves our lemma. 
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In our applications, we need the lemma only when A is self-adjoint 
(ie. symmetric, see below), in which case it is even more trivial. 

For fixed y, the function of x given by <Ax,y> is a functional 
(bounded because of the Schwartz inequality). Hence by the representa- 
tion theorem, there exists an element y* such that (Ax, y> = <x, y*> for 
all x. We define A*, the adjoint of A, by letting A*y = y*. Since y* is 
unique, we see that A* is the unique operator such that 


<Ax, y> = <x, A*y) 
for all x, y in E. 


Theorem 2.4. We have: 


(A + B)* = A* + B*, A** = A, 
(aA)* = xA*, |A*| = |Al, 
(AB)* = B*A*, |AA*| = |A|?. 


and the mapping A — A* is continuous. 


Proof. Exercise for the reader. 


APP., §3. HERMITIAN OPERATORS 


We shall say that an operator A is symmetric (or hermitian) if A = A*. 
Proposition 3.1. A is hermitian if and only if <Ax, x) is real for all x. 


Proof. Let A be hermitian. Then <Ax, x> = <x, Ax> = (Ax, x). Con- 
versely, <Ax, x> = (Ax, x> = (x, Ax> = (A*x, x> implies that 


«(A — A*)x, x> =0 
whence A = A* by polarization. 


Proposition 3.2. Let A be a hermitian operator. Then |A| is the 
greatest lower bound of all values c such that 


|< Ax, x>| S ¢|x/? 


for all x, or equivalently, the sup of all values |« Ax, x>| taken for x on 
the unit sphere in E. 


Proof. When A is hermitian we obtain 


|<Ax, y>| S clxl ly 
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for all x, ye E, so that we get |A| <c in Lemma 2.3. On the other hand, 
c=|A| is certainly a possible value for c by the Schwartz inequality. 
This proves our proposition. 


Proposition 3.2 allows us to define an ordering in the space of hermi- 
tian operators. If A is hermitian, we define A =O and say that A is 
semipositive if «Ax, x> = 0 for all xe E. If A, B are hermitian we define 
A=Bif A—Bz=O. This is indeed an ordering; the usual rules hold: If 
A, = B, and A, 2 B,, then 


AALS BAB 


If c is a real number = 0 and A =O, then cA =O. So far, however, we 
say nothing about a product of semipositive hermitian operators AB, 
even if AB = BA. We shall deal with this question later. 

Let c be a bound for A. Then |< Ax, x>| < c|x|? and consequently 


-—clsAsScl. 


For simplicity, if « is real, we sometimes write « < A instead of al < A, 
and similarly we write A < f instead of A < BI. If we let 


a= inf <Ax, x> and B = sup <Ax, x), 
Ix|=1 


|x|=1 


then we have 
asA<B, 


and from Proposition 3.1, 
|A| = max(|al, ||). 


Let p be a polynomial with real coefficients, and let A be a hermitian 
operator. Write 
P(t) = a,t" + °°" + do. 
We define 
p(A) = a,A" +++: + dol. 


We let R[A] be the algebra generated over R by A, that is the algebra 
of all operators p(A), where p(t)e_R[t]. We wish to investigate the 
closure of R[A] in the (real) Banach space of all operators. We shall 
show how to represent this closure as a ring of continuous functions on 
some compact subset of the reals. First, we observe that the hermitian 
operators form a closed subspace of L(E, E), and that R[A] is a closed 
subspace of the space of hermitian operators. 
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We can find real numbers a, B such that 
al SAS FI. 


We shall prove that if p is a real polynomial which takes on values 2 0 
on the interval [a, B], then p(A) is a semipositive operator. 
The fundamental theorem is the following. 


Theorem 3.3. Let «a, B be real and oI SASBI. Let p be a real 
polynomial, semipositive in the interval a <t < B. Then p(A) is a semi- 
positive operator. 


Proof. We shall need the following obvious facts. 

If A, B are hermitian, A commutes with B, and A =O, then AB? is 
semipositive. 

If p(t) is quadratic, of type p(t) = t? + at + b and has imaginary roots, 


then 
a 2 a 
j=(t+~ -— 
mr=(+45) +(0-4) 
is a sum of squares. 


A sum of squares times a sum of squares is a sum of squares (if they 
commute). 

If p(t) has a root y in our interval, then the multiplicity of y is even. 

Our theorem now follows from the following purely algebraic 
statement. 


Let o<t<f be a real interval, and p(t) a real polynomial which is 
semipositive in this interval. Then p(t) can be written: 


p(t) = c[y. 0? + Y(t — OF? + ¥ (B-1HQ7) 


where Q? just denotes the square of some polynomial and c is a number 
=> 0. 


In order to prove this, we split p(t) over the real numbers into linear 
and quadratic factors. If a root y is S a, then we write 


(t—y)=(t-—a) + (a—y) 
and note that (a — y) is a square. If a root y is = B, then we write 


(y¥—H)=(y—-B) + (B-1) 


with (y — B) a square. We can then write, after expanding out the fac- 
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torization of p(t), 


p(t) = cLY OF + Y(t — OF + VB - HOE + L(t — YB - OF] 


with some constant c and Q? standing for the square of some polyno- 
mial. Note that c is = 0 since p(t) is semipositive on the interval. Our 
last step reduces the bad last term to the preceding ones by means of the 
identity 


(¢— a (B—t) +(e — a(B — 0? 
B-« , 


(t — a)(B — t) = 


Corollary 3.4. If a < p(t) < b in the interval, then 
al < p(A) SDI. 


Suppose that oI SAFI. If p(t) is a real polynomial, we define as 
usual 
IIpl| = sup |p) 


with t ranging over the interval. 


Corollary 3.5. Let af SAS I. Let p(t) be a real polynomial. Then 
|p(A)| S Ill. 


Proof. Let q(t) = |lp|| + p(t). Then q(t) is 2 0 on the interval. Hence 
q(A) 2 O and our assertion follows at once. 


As usual, we consider the continuous functions on the interval as a 
Banach space. If f is any continuous function on the interval, then by 
the Weierstrass approximation theorem, we can find a sequence of poly- 
nomials {p,} approaching f uniformly on this interval. We define f(A) 
as the limit of p,(A). From Corollary 3.5 we deduce that {p,(A)} is a 
Cauchy sequences, and that its limit does not depend on the choice 
of the sequence {p,}. Furthermore, by continuity, our corollary general- 
izes to continuous functions, so that | f(A)| S || fll. 

We see that the map ft» f(A) is a continuous homomorphism from 
the Banach algebra of continuous functions on the interval into the 
closure of the subalgebra generated by A. 


Proposition 3.6. Let A be a semipositive operator. Then there exists an 
operator B in the closure of the algebra generated by A such that 
B* =A. 


Proof. The continuous function t? maps on A??. 
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Corollary 3.7. The product of two semipositive, commuting hermitian 
operators is again semipositive. 


Proof. Let A, C be hermitian and AC = CA. If B is as in Proposition 
3.6, then 


<ACx, x> = (B*Cx, x> = <BCx, Bx) = <CBx, Bx) 2 0. 


The kernel of our homomorphism from the continuous functions to 
the operators is a closed ideal. Its zeros form a closed set called the 
spectrum of A and denoted by o(A). 


Lemma 3.8. Let X be a compact set, R the ring of continuous functions 
on X, and a a closed ideal of R, a#R. Let C be the closed set of 
zeros of a. Then C is not empty and if a function f € R vanishes on C, 
then f € a. 


Proof. Given ¢, let U be the open set where |f|<e«. Then X — U is 
closed. For each point te X — U there exists a function g€a such that 
g(t) £0 in a neighborhood of t. These neighborhoods cover X — U, and 
so does a finite number of them, with functions g,, ....g,. Let g= 
gz +:+-+ 92. Then gea. Our function g has a minimum on X —U 
and for n large, the function 


ng 
Fi + ng 


is close to f on X — U and is < ¢ on U, which proves what we wanted. 


We now redefine the norm of a continuous function f to be 
Iflla = sup |f(Ol- 
tea(A) 


Theorem 3.9. The map 


Fr f(A) 


induces a Banach-isomorphism (i.e. norm-preserving) of the Banach alge- 
bra of continuous functions on o(A) onto the closure of the algebra 
generated by A. 


Proof. We have already proved that our map is an agebraic isomor- 
phism and that |f(A)| S ||fll4. In order to get the reverse inequality, we 
shall prove: 
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If f(A) 2 O, then f(t) = 0 on the spectrum of A. Indeed, if f(c) < 0 for 
some céa(A), we let g(t) be a function which is 0 outside a small 
neighborhood of c, is 2 0 everywhere, and is >0 at c. Then g(A) 
and g(A)f(A) are both = 0 by Corollary 3.7. But —g(t)f(t)=0 gives 
—g(A)f(A) 2 O whence g(A)f(A) = O. Since g(t)f(t) is not 0 on the spec- 
trum of A, we get a contradiction. 

Let now s=|f(A)|. Then sf —f(A)2=O implies that s— f(t) = 0, 
which proves the theorem. 


From now on, the norm on continuous functions will refer to the 
spectrum. All that remains to do is identify our spectrum with what can 
be called the general spectrum, that is those complex values € such that 
A — ¢ is not invertible. (By invertible, we mean having an inverse which 
is an operator.) 


Theorem 3.10. The general spectrum is compact, and in fact, if & is in 
it, then |€| <|A|. If A is hermitian, then the general spectrum is equal 
to o(A). 


Proof. The complement of the general spectrum is open, because if 
A — €o is invertible, and € is close to &), then (A — &y) 1(A — €) is close 
to I, hence invertible, and hence A — é is also invertible. Furthermore, if 
¢ > Al, then |A/|<1 and hence I —(A/é) is invertible (by the power 
series argument). So is A — € and we are done. Finally, suppose that ¢ 
is in the general spectrum. Then € is real. Otherwise, let 


g(t) = (t — E)(t — 2). 


Then g(t) #0 on o(A) and h(t) = 1/g(t) is its inverse. From this we see 
that A — é is invertible. 

Suppose € is not in the spectrum. Then t — € is invertible and so is 
A-€, 

Suppose ¢ is in the spectrum. After a translation, we may suppose 
that 0 is in the spectrum. Consider the function g(t) as follows: 


_ file Ie 1/N, 
ao 1 It] < 1/N, 


(g is positive and has a peak at 0.) If A is invertible, BA = I, then from 
|tg(t)| S 1 we get |Ag(A)| <1 and hence |g(A)| <|B|. But g(A) becomes 
arbitrarily large as we take N large. Contradiction. 


Theorem 3.11. Let S be a set of operators of the Hilbert space E, 
leaving no closed subspace invariant except 0 and E itself. Let A be a 
Hermitian operator such that AB = BA for all Be S. Then A=AlI for 
some real number 4. 
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Proof. It will suffice to prove that there is only one element in the 
spectrum of A. Suppose there are two, 1, #4,. There exist continuous 
functions f, g on the spectrum such that neither is 0 on the spectrum, 
but fg is 0 on the spectrum. For instance, one may take for f, g the 
functions whose graph is indicated on the next diagram. 


Ml a) 


We have f(A)B = Bf(A) for all Be S (because B commutes with real 
polynomials in A, hence with their limits). Hence f(A)E is invariant 
under S because 


BS(A)E = f(A)BE c f(A)E. 


Let F be the closure of f(A)E. Then F #0 because f(A) # O. Further- 
more, F #4 E because g(A)f(A)E =0 and hence g(A)F =0. Since F is 
obviously invariant under S, we have a contradiction. 


Corollary 3.12. Let S be a set of operators of the Hilbert space E, 
leaving no closed subspace invariant except 0 and E itself. Let A be 
an operator such that AA* = A*A, AB = BA, and A*B = BA* for all 
BeS. Then A=AlI for some complex number A. 


Proof. Write A = A, + iA, where A,, A, are hermitian and commute 
(e.g. A, =(A + A*)/2). Apply the theorem to each one of A, and A, to 
get the result. 
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